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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATGCATCCCA TGCATCCCGA AAACCACGCC ATCGCCCGGA GCACGAGCAC CACTAATAAC 60 

CCATCTCGCA GTCGGAGCAG CAGGATGTGG CTCCTGCCCG CCTGGCTGCT CCTCGTCCTG 12 0 

GTGGCCAGCA ATGGCCTGCC AG C AGTCAGA GGCCAGTACC AATCGCCACG TATCATCGAG 18 0 

CATCCCACGG ATCTGGTCGT TAAGAAGAAT GAACCCGCCA CGCTCAACTG CAAAGTGGAG 24 0 

GGCAAGCCGG AACCCACCAT TGAGTGGTTT AAGGATGGCG AACCCGTCAG CACCAACGAA 3 00 

AAGAAATCGC ACCGCGTCCA GTTCAAGGAC GGCGCCCTCT TCTTTTACAG GACAATGCAA 3 60 

GGCAAGAAGG AGCAGGACGG CGGAGAGTAC TGGTGCGTGG CCAAGAACCG AGTGGGC C AG 42 0 

GCCGTTAGTC GCCATGCCTC CCTCCAGATA GCTGTTTTGC GCGACGATTT TCGCGTGGAG 4 80 

CCCAAAGACA CGCGAGTGGC CAAAGGCGAG ACGGCTCTGC TGGAGTGTGG GCCGCCCAAA 54 0 

GGCATTCCAG AGCCAACGCT GATTTGGATA AAGGACGGCG TTCCCTTGGA CGACCTGAAA 60 0 

GCCATGTCGT TTGGCGCCAG CTCCCGCGTT CGAATTGTGG ACGGTGGCAA CCTGCTGATC 66 0 

Jff AG C AATGTGG AGCCCATTGA TGAGGGCAAC TACAAGTGCA TTGCCCAGAA TCTGGTAGGC 72 0 

■yy ACCCGCGAGA GCAGCTATGC CAAGCTGATT GTCCAGGTCA AACCATACTT TATGAAGGAG 7 80 

CCCAAGGATC AGGTGATGCT CTACGGCCAG ACAGCCACTT TCCACTGCTC AGTGGGCGGT 84 0 

t£ GATCCGCCGC CGAAAGTGTT GTGGAAAAAG GAGGAGGGCA ATATTCCGGT GTCCAGAGCG 90 0 

7% Z CGAATCCTTC ACGACGAGAA AAGTTTAGAG ATATCCAACA TAACGCCCAC CGATGAGGGC 96 0 

s " ACCTATGTCT GCGAGGCACA CAACAATGTC GGTCAGATCA GCGCTAGGGC TTCTCTTATA 102 0 

H GTCCACGCTC CGCCGAACTT TACGAAAAGA CCCAGTAACA AGAAAGTGGG ACTAAATGGG 108 0 

L, GTTGTCCAAC TACCTTGCAT GGCCTCCGGA AACCCTCCGC CGTCTGTATT CTGGACCAAG 114 0 

=P GAAGGAGTAT CCACTCTTAT GTTCCCAAAT AGTTCGCACG GAAGGCAGTA TGTGGCTGCC 12 0 0 

?t GATGGAACTC TGCAGATTAC GGATGTGCGG CAGGAAGACG AAGGCTACTA TGTGTGTTCC 12 6 0 

GCTTTCAGTG TAGTCGATTC CTCTACAGTA CGGGTTTTCC TGCAAGTCAG CTCGGTAGAC 132 0 

GAGCGTCCAC CTCCGATTAT TCAAATCGGA CCTGCCAATC AAACACTGCC C AAGGG AT C A 13 8 0 

GTTGCTACTT TACCCTGTCG GGCCACTGGA AATCCCAGTC CCCGTATCAA GTGGTTCCAC 144 0 

GATGGACATG CCGTACAAGC GGGCAATCGA TACAGCATCA TCCAAGGAAG CTCACTGAGA 150 0 

GTCGATGACC TTCAACTAAG TGACTCTGGT ACCTACACCT GCACTGCATC TGGCGAACGA 156 0 

GGAGAAACTT CCTGGGCTGC CACACTAACG GTGGAAAAAC CCGGTTCTAC ATCTCTTCAC 162 0 

CGGGCAGCTG ATC CTAGC AC TTATCCTGCT CCTCCAGGAA CACCTAAAGT CCTGAATGTC 168 0 

AGTCGCACCA GCATTAGTCT TCGTTGGGCT AAAAGCCAAG AGAAACCCGG AGCTGTGGGC 174 0 

CCAATCATTG GAT AC AC TGT AGAGTACTTC AGTCCGGATC TGCAAACTGG TTGGATTGTG 18 0 0 

GCTGCCCATC GAGTCGGCGA CACTCAAGTC ACTATCTCGG GTCTCACTCC TGGCACTTCG 18 60 

TATGTGTTCC TAGTTAGAGC TGAGAATACT CAGGGTATTT CTGTGCCTTC CGGCTTATCA 192 0 

AATGTTATTA AAACCATTGA GGCAGATTTC GATGCAGCTT CTGCCAATGA TTTGTCAGCA 198 0 

GCTCGAACTT TGCTGACAGG AAAGTCGGTG GAGCTAATAG ATGCCTCGGC TATCAATGCT 2 04 0 

AGTGCCGTTA GACTTGAGTG GATGCTCCAC GTGAGCGCTG ATGAGAAATA CGTAGAGGGC 210 0 
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CTGCGCATAC 

ACTATAAGGA 

TGCCAGTGTA 

CCATCCGCAC 

AGTATCACTC 

GATCACTGTT 

2160 

ATGGATGCCT 

CTG CAGAATC 

GTTTGTGGTG 

GGAAACCTTA 

AGAAGTACAC 

CAAGTATGAG 

2220 

TTCTTCCTAA 

CACCCTTTTT 

TGAGACAATT 

GAAGGACAGC 

CCAGTAACTC 

CAAGACAGCC 

2280 

CTCACCTATG 

AAGATGTTCC 

CTCCGCACCA 

C CGGATAAC A 

TTCAGATTGG 

CATGTACAAC 

2340 

CAAACAGCCG 

GTTGGGTGCG 

TTGGACTCCG 

CCACCCTCCC 

AGCACCACAA 

TGGCAATTTG 

2400 

TATGGCTACA 

AGATTGAGGT 

CAGCGCCGGT 

AAC AC C ATG A 

AGGTGCTGGC 

CAATATGACT 

2460 

CTTAATGCTA 

CCACCACATC 

TGTGCTCCTA 

AATAACCTAA 

CCACCGGAGC 

TGTGTACAGC 

2520 

GTGAGGTTGA 

ACTCCTTTAC 

CAAGGCAGGA 

GATGGACCTT 

ACTCCAAACC 

GATATCACTA 

2580 

TTCATGGACC 

CCACCCATCA 

TGTGCATCCG 

CCACGGGCAC 

ATCCAAGCGG 

CACCCATGAT 

2640 

GGGCGACATG 

AGGGACAGGA 

TCTCACGTAT 

CATAACAATG 

GCAACATACC 

ACCTGGCGAC 

2700 

ATTAATCCCA 

CCACTCATAA 

AAAGACCACT 

GACTACCTAT 

CTGGACCGTG 

GCTAATGGTG 

2760 

CTGGTCTGCA 

TCGTTCTTCT 

AGTCCTGGTT 

ATTTCGGCGG 

CTATTTCGAT 

GGTCTACTTC 

2820 

AAGCGCAAGC 

ATCAAATGAC 

CAAGGAATTG 

GGTCACTTAA 

GTGTGGTCAG 

TG AC AAC G AA 

2880 

ATAAC CGC AT 

TAAATATCAA 

TAGCAAAGAG 

AGCCTTTGGA 

TAGACCATCA 

TCGTGGATGG 

2940 

CGAACTGCCG 

ATACTGACAA 

AGACTCAGGA 

TTAAGCGAAT 

CGAAGCTACT 

ATCCCACGTT 

3000 

AAC AG C AGT C 

AATCCAACTA 

CAATAACTCC 

GATGGAGGAA 

CCGATTATGC 

AGAAGTTGAC 

3060 

ACCCGTAACC 

TTACCACCTT 

CTACAATTGT 

CGCAAGAGCC 

CCGATAATCC 

CACGCCGTAC 

3120 

GCCACCACTA 

TGATCATTGG 

TACCTCTTCC 

AGTGAGACCT 

GCACCAAGAC 

AACATCTATA 

3180 

AGTGC CGATA 

AGGACTCGGG 

AACTCATTCG 

CCCTATTCTG 

ACGCATTTGC 

CGGTCAGGTG 

3240 

CCAGCGGTTC 

CTGTTGTCAA 

ATC CAACTAT 

CTTCAGTATC 

CGGTTGAACC 

GATCAACTGG 

3300 

TCAGAGTTTC 

TACCCCCGCC 

GC CAGAAC AC 

CCACCTCCGT 

CTTCTACCTA 

TGGATACGCA 

3360 

CAAGGATCTC 

CTGAATCTTC 

GCGGAAGAGC 

TCCAAAAGCG 

CAGGTTCCGG 

CATTTCTACA 

3420 

AATCAAAGCA 

TTCTGAACGC 

AT C CAT AC AC 

AGCAGCTCCT 

CGGGCGGCTT 

TTCAGCTTGG 

3480 

GGAGTATCGC 

CCCAATATGC 

TGTCGCCTGT 

C C AC CGGAAA 

ACGTTTATAG 

CAATCCGCTG 

3540 

TCGG CAGTGG 

CTGGCGGCAC 

CCAGAACCGC 

TATCAGATAA 

CGCCCACAAA 

CCAACATCCG 

3600 

CCACAGTTAC 

CGGCCTACTT 

TGCCACCACG 

GGTCCAGGAG 

GAGCTGTACC 

ACCCAACCAC 

3660 

PTGCCATTTG 

CCACACAGCG 

TCATGCAGCC 

AG CGAGTAC C 

AGGCTGGACT 

GAATGCAGCG 

3720 

CGATGTGCCC 

AAAGCCGCGC 

CTGCAACAGC 

TGCGATGCCT 

TGGCCACACC 

CTCGCCCATG 

3780 

CAACCCCCAC 

CGCCAGTTCC 

CGTACCCGAG 

GGCTGGTACC 

AACCGGTGCA 

TCCCAATAGC 

3840 

CACCCGATGC 

ACCCGACCTC 

CTCCAACCAC 

CAGATCTACC 

AGTGCTCCTC 

CGAGTGCTCG 

3900 

GATCACTCGA 

GGAGCTCGCA 

GAGTCACAAG 

CGGCAGCTGC 

AGCTCGAGGA 

GCACGGCAGC 

3960 

AGTGCCAAAC 

AACGCGGAGG 

AC AC C AC CG T 

C Cj AC kj ACj (_ U (- 



a fi o n 

^fc VJ z, u 

GAGAGCGAGA 

ACGAGAACAT 

GCTGGCGGAG 

TACGAGCAGC 

GCCAGTACAC 

CAGCGATTGC 

4080 

TGCAATAGCT 

CCCGCGAGGG 

CGACACCTGC 

TCCTGCAGCG 

AGGGATCCTG 

TCTTTACGCC 

4140 

GAGGCGGGCG 

AGCCGGCGCC 

TCGTCAAATG 

ACTGCTAAGA 

ACACCTAA 


4188 


(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 13 95 amino acids. 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met His Pro Met His Pro Glu Asn His Ala lie Ala Arg Ser Thr Ser 

15 10 15 

Thr Thr Asn Asn Pro Ser Arg Ser Arg Ser Ser Arg Met Trp Leu Leu 

20 25 30 

Pro Ala Trp Leu Leu Leu Val Leu Val Ala Ser Asn Gly Leu Pro Ala 

3 5 4 0 4 5 

Val Arg Gly Gin Tyr Gin Ser Pro Arg lie lie Glu His Pro Thr Asp 

50 55 60 

Leu Val Val Lys Lys Asn Glu Pro Ala Thr Leu Asn Cys Lys Val Glu 
65 70 75 80 

Gly Lys Pro Glu Pro Thr lie Glu Trp Phe Lys Asp Gly Glu Pro Val 

85 90 95 

Ser Thr Asn Glu Lys Lys Ser His Arg Val Gin Phe Lys Asp Gly Ala 

100 105 110 

Leu Phe Phe Tyr Arg Thr Met Gin Gly Lys Lys Glu Gin Asp Gly Gly 

115 120 125 

Glu Tyr Trp Cys Val Ala Lys Asn Arg Val Gly Gin Ala Val Ser Arg 

130 135 140 

His Ala Ser Leu Gin lie Ala Val Leu Arg Asp Asp Phe Arg Val Glu 
145 150 155 160 

Pro Lys Asp Thr Arg Val Ala Lys Gly Glu Thr Ala Leu Leu Glu Cys 

165 170 175 

Gly Pro Pro Lys Gly lie Pro Glu Pro Thr Leu lie Trp lie Lys Asp 

180 185 190 

Gly Val Pro Leu Asp Asp Leu Lys Ala Met Ser Phe Gly Ala Ser Ser 

195 200 205 

Arg Val Arg lie Val Asp Gly Gly Asn Leu Leu lie Ser Asn Val Glu 

210 215 220 

Pro lie Asp Glu Gly Asn Tyr Lys Cys lie Ala Gin Asn Leu Val Gly 
225 230 235 240 

Thr Arg Glu Ser Ser Tyr Ala Lys Leu lie Val Gin Val Lys Pro Tyr 
245 250 255 
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Phe Met Lys Glu Pro Lys Asp Gin Val Met Leu Tyr Gly Gin Thr Ala 

260 265 270 

Thr Phe His Cys Ser Val Gly Gly Asp Pro Pro Pro Lys Val Leu Trp 

275 280 285 

Lys Lys Glu Glu Gly Asn lie Pro Val Ser Arg Ala Arg lie Leu His 

290 295 300 

Asp Glu Lys Ser Leu Glu lie Ser Asn lie Thr Pro Thr Asp Glu Gly 
305 310 315 320 

Thr Tyr Val Cys Glu Ala His Asn Asn Val Gly Gin lie Ser Ala Arg 

325 330 335 

Ala Ser Leu lie Val His Ala Pro Pro Asn Phe Thr Lys Arg Pro Ser 

340 345 350 

Asn Lys Lys Val Gly Leu Asn Gly Val Val Gin Leu Pro Cys Met Ala 

355 360 365 

Ser Gly Asn Pro Pro Pro Ser Val Phe Trp Thr Lys Glu Gly Val Ser 

370 375 380 

Thr Leu Met Phe Pro Asn Ser Ser His Gly Arg Gin Tyr Val Ala Ala 
385 390 395 400 

Asp Gly Thr Leu Gin lie Thr Asp Val Arg Gin Glu Asp Glu Gly Tyr 

405 410 415 

Tyr Val Cys Ser Ala Phe Ser Val Val Asp Ser Ser Thr Val Arg Val 

420 425 430 

Phe Leu Gin Val Ser Ser Val Asp Glu Arg Pro Pro Pro lie lie Gin 

435 440 445 

lie Gly Pro Ala Asn Gin Thr Leu Pro Lys Gly Ser Val Ala Thr Leu 

450 455 460 

Pro Cys Arg Ala Thr Gly Asn Pro Ser Pro Arg lie Lys Trp Phe His 
465 470 475 480 

Asp Gly His Ala Val Gin Ala Gly Asn Arg Tyr Ser lie lie Gin Gly 

485 490 495 

Ser Ser Leu Arg Val Asp Asp Leu Gin Leu Ser Asp Ser Gly Thr Tyr 

500 505 510 

Thr Cys Thr Ala Ser Gly Glu Arg Gly Glu Thr Ser Trp Ala Ala Thr 

515 520 525 

Leu Thr Val Glu Lys Pro Gly Ser Thr Ser Leu His Arg Ala Ala Asp 

530 535 540 

Pro Ser Thr Tyr Pro Ala Pro Pro Gly Thr Pro Lys Val Leu Asn Val 
545 550 555 560 
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Ser Arg Thr Ser lie Ser Leu Arg Trp Ala Lys Ser Gin Glu Lys Pro 

565 570 575 

Gly Ala Val Gly Pro lie lie Gly Tyr Thr Val Glu Tyr Phe Ser Pro 

580 585 590 

Asp Leu Gin Thr Gly Trp lie Val Ala Ala His Arg Val Gly Asp Thr 

595 600 605 

Gin Val Thr lie Ser Gly Leu Thr Pro Gly Thr Ser Tyr Val Phe Leu 

610 615 620 

Val Arg Ala Glu Asn Thr Gin Gly lie Ser Val Pro Ser Gly Leu Ser 
625 630 635 640 

Asn Val lie Lys Thr lie Glu Ala Asp Phe Asp Ala Ala Ser Ala Asn 

645 650 655 

Asp Leu Ser Ala Ala Arg Thr Leu Leu Thr Gly Lys Ser Val Glu Leu 

660 665 670 

lie Asp Ala Ser Ala lie Asn Ala Ser Ala Val Arg Leu Glu Trp Met 

675 680 685 

Leu His Val Ser Ala Asp Glu Lys Tyr Val Glu Gly Leu Arg lie His 

690 695 700 

Tyr Lys Asp Ala Ser Val Pro Ser Ala Gin Tyr His Ser lie Thr Val 
705 710 715 720 

Met Asp Ala Ser Ala Glu Ser Phe Val Val Gly Asn Leu Lys Lys Tyr 

725 730 735 

Thr Lys Tyr Glu Phe Phe Leu Thr Pro Phe Phe Glu Thr lie Glu Gly 

740 745 750 

Gin Pro Ser Asn Ser Lys Thr Ala Leu Thr Tyr Glu Asp Val Pro Ser 

755 760 765 

Ala Pro Pro Asp Asn lie Gin lie Gly Met Tyr Asn Gin Thr Ala Gly 

770 775 780 

Trp Val Arg Trp Thr Pro Pro Pro Ser Gin His His Asn Gly Asn Leu 
785 790 795 800 

Tyr Gly Tyr Lys lie Glu Val Ser Ala Gly Asn Thr Met Lys Val Leu 

805 810 815 

Ala Asn Met Thr Leu Asn Ala Thr Thr Thr Ser Val Leu Leu Asn Asn 

820 825 830 

Leu Thr Thr Gly Ala Val Tyr Ser Val Arg Leu Asn Ser Phe Thr Lys 

835 840 845 

Ala Gly Asp Gly Pro Tyr Ser Lys Pro lie Ser Leu Phe Met Asp Pro 
850 855 860 
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Thr His His Val His Pro Pro Arg Ala His Pro Ser Gly Thr His Asp 
865 870 875 880 

Gly Arg His Glu Gly Gin Asp Leu Thr Tyr His Asn Asn Gly Asn lie 

885 890 895 

Pro Pro Gly Asp lie Asn Pro Thr Thr His Lys Lys Thr Thr Asp Tyr 

900 905 910 

Leu Ser Gly Pro Trp Leu Met Val Leu Val Cys lie Val Leu Leu Val 

915 920 925 

Leu Val lie Ser Ala Ala lie Ser Met Val Tyr Phe Lys Arg Lys His 

930 935 940 

Gin Met Thr Lys Glu Leu Gly His Leu Ser Val Val Ser Asp Asn Glu 
945 950 955 960 

lie Thr Ala Leu Asn lie Asn Ser Lys Glu Ser Leu Trp lie Asp His 

965 970 975 

His Arg Gly Trp Arg Thr Ala Asp Thr Asp Lys Asp Ser Gly Leu Ser 

980 985 990 

Glu Ser Lys Leu Leu Ser His Val Asn Ser Ser Gin Ser Asn Tyr Asn 

995 1000 1005 

Asn Ser Asp Gly Gly Thr Asp Tyr Ala Glu Val Asp Thr Arg Asn Leu 

1010 1015 1020 

Thr Thr Phe Tyr Asn Cys Arg Lys Ser Pro Asp Asn Pro Thr Pro Tyr 
1025 1030 1035 1040 

Ala Thr Thr Met lie lie Gly Thr Ser Ser Ser Glu Thr Cys Thr Lys 

1045 1050 1055 

Thr Thr Ser lie Ser Ala Asp Lys Asp Ser Gly Thr His Ser Pro Tyr 

1060 1065 1070 

Ser Asp Ala Phe Ala Giy Gin Val Pro Ala Val Pro Val Val Lys Ser 

1075 1080 1085 

Asn Tyr Leu Gin Tyr Pro Val Glu Pro lie Asn Trp Ser Glu Phe Leu 

1090 1095 1100 

Pro Pro Pro Pro Glu His Pro Pro Pro Ser Ser Thr Tyr Gly Tyr Ala 
1105 1110 1115 1120 

Gin Gly Ser Pro Glu Ser Ser Arg Lys Ser Ser Lys Ser Ala Gly Ser 

1125 1130 1135 

Gly lie Ser Thr Asn Gin Ser lie Leu Asn Ala Ser lie His Ser Ser 

1140 1145 1150 

Ser Ser Gly Gly Phe Ser Ala Trp Gly Val Ser Pro Gin Tyr Ala Val 
1155 1160 1165 
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Ala Cys Pro Pro Glu Asn Val Tyr Ser Asn Pro Leu Ser Ala Val Ala 

1170 1175 1180 

Gly Gly Thr Gin Asn Arg Tyr Gin lie Thr Pro Thr Asn Gin His Pro 
1185 1190 1195 1200 

Pro Gin Leu Pro Ala Tyr Phe Ala Thr Thr Gly Pro Gly Gly Ala Val 

1205 1210 1215 

Pro Pro Asn His Leu Pro Phe Ala Thr Gin Arg His Ala Ala Ser Glu 

1220 1225 1230 

Tyr Gin Ala Gly Leu Asn Ala Ala Arg Cys Ala Gin Ser Arg Ala Cys 

1235 1240 ^ 1245 

Asn Ser Cys Asp Ala Leu Ala Thr Pro Ser Pro Met Gin Pro Pro Pro 

1250 1255 _ 1260 

Pro Val Pro Val Pro Glu Gly frrp Tyr \Gln Pro Val His Pro Asn Ser 
1265 1270 1275 1280 

y His Pro Met His Pro Thr Ser Ser Asn His Gin lie Tyr Gin Cys Ser 

vj§ 1285 1290 1295 

' s ~* Ser Glu Cys Ser Asp His Ser Arg Ser Ser Gin Ser His Lys Arg Gin 

L. 1300 1305 1310 

N! Leu Gin Leu Glu Glu His Gly Ser Ser Ala Lys Gin Arg Gly Gly His 

ro 

1" 1315 1320 1325 

M His Arg Arg Arg Ala Pro Val Val Gin Pro Cys Met Glu Ser Glu Asn 

[7" 1330 1335 1340 

=p Glu Asn Met Leu Ala Glu Tyr Glu Gin Arg Gin Tyr Thr Ser Asp Cys 

.S3.. 

fi 1345 1350 1355 1360 

Cys Asn Ser Ser Arg Glu Gly Asp Thr Cys Ser Cys Ser Glu Gly Ser 

1365 1370 1375 

Cys Leu Tyr Ala Glu* Ala Gly Glu Pro Ala Pro Arg Gin Met Thr Ala 
1380 1385 1390 

Lys Asn Thr 
1395 


(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GGTGAAAATC C ACG CAT CAT CGAGCATCCC ATGGACACGA CGGTGC CAAA AAATGATCCA 60 

TTTACGTTTA ATTGCCAGGC CGAGGGCAAT CCAACACCAA CCATTCAATG GTTTAAGGAC 12 0 

GGTCGCGAAC TGAAGACGGA TACGGGTTCG CATCGCATAA TGCTGCCCGC CGGGGGTCTA 180 

TTCTTTCTCA AGGTTATCCA CTCACGTAGA GAGAGCGATG CGGGCACTTA CTGGTGCGAG 24 0 

GCCAAAAACG AGTTTGGAGT GGCACGGTCC AGGAATGCAA CGTTGCAAGT GGCAGTTCTC 300 

CGCGACGAAT TCCGTTTGGA GCCGGCAAAT ACCCGCGTGG CCCAAGGCGA GGTGGCCCTG 3 60 

ATGGAATGCG GTGCCCCCCG AGGATCTCCG GAGCCGCAAA TCTCGTGGCG CAAGAACGGC 42 0 

CAGACCCTGA ATCTTGTCGG GAACAAGCGG ATTCGCATTG TCGACGGTGG CAATCTGGCC 480 

ATCCAGGAAG CCCGCCAATC GGACGACGGA CGCTACCAGT GTGTGGTCAA GAATGTGGTT 54 0 

GGCACCCGGG AGTCGGCCAC CGCTTTTCTT AAAGTGCATG TACGTCCATT CCTCATCCGA 600 

GGACCCCAGA ATCAGACGGC GGTGGTGGGC AGCTCGGTGG TCTTCCAGTG CCGCATCGGA 660 

GGCGATCCCC TGCCTGATGT CCTGTGGCGA CGCACTGCCT CCGGCGGCAA TATGCCACTG 72 0 

CGTAAGTTTT CTTGGCTTCA TTCAGCTTCA GGTCGTGTGC ACGTACTTGA GGACCGCAGT 780 

CTGAAGCTGG ACGACGTTAC TCTGGAGGAC ATGGGCGAGT ACACTTGCGA GGCGGACAAT 84 0 

GCGGTGGGCG GCATCACGGC CACTGGCATC CTCACCGTTC ACGCTCCCCC CAAATTTGTG 900 

ATACGCCCCA AGAATCAGCT GGTGGAGATC GGTGATGAAG TGCTGTTCGA GTGCCAAGCG 96 0 

AATGGACATC CCCGACCAAC GCTCTACTGG TCGGTGGAGG GCAACAGCTC CCTGCTGCTC 102 0 

CCCGGCTATC GGGATGGCCG CATGGAAGTG ACCCTGACGC CCGAGGGGCG CTCGGTGCTC 10 8 0 

TCGATAGCTC GATTTGCCCG TGAGGATTCC GGAAAGGTGG TCACTTGCAA CGCCCTGAAC 114 0 

GCCGTGGGCA GCGTCAGCAG TCGGACTGTG GTCAGTGTGG ATACGCAATT CGAGCTGC C A 12 0 0 

CCGCCGATTA TCGAACAGGG GCCCGTGAAT CAAACGTTGC CCGTTAAATC AATTGTGGTT 12 60 

CTGCCATGCC GAACTCTGGG CACTCCAGTG CCACAGGTCT CTTGGTACCT GGATGGCATA 132 0 

CCCATCGATG TGCAGGAGCA CGAGCGGCGG AATCTTTCGG ACGCTGGAGC CTTAACCATT 13 8 0 

TCGGATCTTC AGCGCCACGA GGATGAAGGC TTGT AC AC C T GCGTGGCCAG CAATCGCAAC 144 0 

GGAAAATCCT CTTGGAGTGG TTACCTTCGT CTGGACACCC CGACAAATCC GAATATCAAG 15 0 0 

TTCTTCAGAG CCCCAGAACT TTCCACCTAC CCAGGGCCGC CAGGAAAACC GCAAATGGTG 15 60 

GAGAAGGGCG AAAATTCGGT GACTCTCAGC TGGACGAGGA GCAACAAGGT GGGCGGCTCC 162 0 

AGTCTGGTGG GCTATGTAAT CGAGATGTTT GGCAAAAACG AAACGGATGG CTGGGTGGCT 168 0 

GTGGGCACTA GGGTGCAAAA TACCACGTTT AC C C AAACGG GTCTGCTGCC GGGTGTGAAT 174 0 

TACTTCTTTC TAATTCGAGC CGAGAACTCC CATGGCTTAT CACTGCCCAG TCCGATGTCG 1800 

G AAC C C ATT A CGGTGGGAAC GCGCTACTTC AATAGTGGTC TGGATCTGAG CGAGGCTCGT 186 0 

GCCAGTCTGC TGTCCGGAGA TGTTG TGG AG CTGAGCAACG CCAGTGTGGT GGACTCCACT 192 0 

AGCATGAAAC TCACCTGGCA GATCATCAAT GGCAAATACG TCGAGGGCTT CTATGTCTAT 198 0 

GCGAGACAGT TGCCAAATCC AATAGTCAAC AATCCGGCGC CCGTTACTAG CAATACCAAT 2 04 0 

CCGCTGCTGG GCTCTACATC CACATCCGCA TCCGCATCCG CCTCGGCATC GGCATTGATT 210 0 

TCGACAAAGC CAAATATTGC AGCTGCCGGC AAACGTGATG GGGAGACAAA CCAGAGTGGA 2160 

GGAGGAGCTC CGACCCCACT GAACACCAAG TATCGCATGC TAACGATTCT CAATGGCGGT 222 0 
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GGCGCCTCAT CCTGCACCAT CACCGGGCTC GTCCAGTACA CGCTGTATGA ATTTTTCATC 22 8 0 

GTGCCATTTT ACAAATCCGT CGAGGGCAAG CCGTCGAATT CGCGCATCGC TCGCACCCTT 2 34 0 

GAAGATGTTC CCTCTGAGGC ACCATATGGA ATGGAGGCTC TGCTGTTGAA CTCCTCCGCG 24 00 

GTCTTCCTCA AATGGAAGGC ACCAGAACTC AAGGATCGGC ATGGTGTTCT CTTGAACTAT 24 60 

CATGTTATAG TCCGAGGTAT TGACACTGCC CACAATTTCT CACGCATTTT GACAAATGTC 2 52 0 

ACCATCGATG CCGCTTCGCC TACTCTGGTT TTGGCCAATC TCACCGAAGG CGTCATGTAC 25 8 0 

ACCGTGGGCG TGGCGGCCGG AAATAACGCT GGAGTTGGTC CTTATTGTGT CCCAGCTACT 2 64 0 

TTGCGTTTGG ATCCCATCAC AAAGCGACTC GATCCGTTCA TCAATCAGCG GGACCATGTT 2 700 

AACGATGTGC TGACGCAGCC CTGGTTCATA ATACTCCTGG GCGCCATCCT GGCCGTTCTT 2 760 

ATGCTGTCCT TTGGCGCAAT GGTCTTTGTG AAGCGCAAGC ACATGATGAT GAAGCAGTCG 2 82 0 

GCCCTAAATA CAATGCGTGG CAATCACACG AGCGACGTGC TCAAAATGCC GAGTCTATCG 2 880 

GCGCGCAATG GAAACGGCTA CTGGCTGGAC TCCTCCACCG GCGGAATGGT GTGGCGTCCC 2 94 0 

TCGCCCGGCG GCGACTCGCT GGAGATGCAA AAGGATCACA TCGCCGACTA TGCGCCGGTC 3 000 

TGCGGTGCCC CCGGTTCTCC GGCCGGCGGT GGCACCTCTT CCGGTGGATC CGGTGGCGCG 3 060 

GGCAGCGGTG CCAGCGGCGG CGATGACATT CATGGAGGAC ACGGCAGCGA ACGCAATCAG 312 0 

CAGCGGTACG TGGGCGAGTA CTCCAACATA CCGACCGACT ATGCAGAGGT GTC CAGTTTT 3180 

GGCAAGGCAC CCAGCGAGTA TGGTCGGCAT GGCAACGCCT CCCCGGCCCC TTATGCCACC 3 24 0 

TCTTCGATCC TGAGTCCCCA CCAGCAGCAA CAGCAGCAGC AGCCGCGTTA TCAACAGCGA 3 3 00 

CCAGTGCCCG GCTATGGGCT CCAGCGCCCA ATGCACCCAC ACTACCAGCA GCAGCAGCAT 3 3 60 

CAGCAGCAAC AGGCGCAGCA GACGCACCAG CAACACCAGG CTCTCCAGCA GCACCAGCAA 342 0 

CTGCCACCCA GCAACATCTA CCAGCAGATG TCCACCACCA GCGAGATATA CCCCACGAAC 34 8 0 

ACGGGTCCTT CGCGCTCTGT CTACTCTGAG CAGTATTACT ACCCCAAGGA CAAGCAGAGA 3 54 0 

CACATCCACA TCACCGAGAA CAAGCTGAGC AACTGCCACA CCTATGAGGC GGCTCCTGGC 3 60 0 

GCCAAGCAGT CCTCGCCGAT ATCCTCGCAG TTCGCCAGCG TGAGGCGGCA GCAGCTGCCG 3 660 

CCCAACTGCA GCATCGGCAG GGAAAGTGCC CGCTTCAAGG TGCTAAACAC GGATCAGGGC 3 72 0 

AAGAACCAGC AGAATCTCCT GGATCTCGAC GGCTCCTCGA TGTGCTACAA CGGTCTGGCA 3 7 80 

GACTCGGGCT GCGGTGGATC TCCCTCCCCG ATGGCCATGC TGATGTCGCA CGAGGACGAG 3 84 0 

CACGCGCTGT ACCACACGGC GGATGGGGAT CTGGACGACA TGGAACGACT GTACGTCAAG 3 900 

GTGGACGAGC AGCAGCCTCC ACAGCAGCAG CAGCAGCTGA TTCCCCTGGT CCCACAGCAT 3 96 0 

CCGGCGGAAG GTCACCTGCA GTCCTGGCGG AATCAGAGCA CGCGGAGCAG TCGGAAGAAC - 4 02 0 

GGCCAGGAAT GCATCAAGGA ACCCAGCGAG TTGATCTACG CTCCGGGAAG CGTGGCCAGC 4080 

GAACGGAGCC TCCTCAGCAA CTCGGGTAGC GGCACCAGCA GCCAGCCAGC TGGCCACAAT 414 0 

GTCTGA 414 6 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1381 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Gly Glu Asn Pro Arg lie lie Glu His Pro Met Asp Thr Thr Val Pro 

15 10 15 

Lys Asn Asp Pro Phe Thr Phe Asn Cys Gin Ala Glu Gly Asn Pro Thr 

20 25 30 

Pro Thr lie Gin Trp Phe Lys Asp Gly Arg Glu Leu Lys Thr Asp Thr 

35 40 45 

Gly Ser His Arg lie Met Leu Pro Ala Gly Gly Leu Phe Phe Leu Lys 

50 55 60 

Val lie His Ser Arg Arg Glu Ser Asp Ala Gly Thr Tyr Trp Cys Glu 
65 70 75 80 

Ala Lys Asn Glu Phe Gly Val Ala Arg Ser Arg Asn Ala Thr Leu Gin 

85 90 95 

Val Ala Val Leu Arg Asp Glu Phe Arg Leu Glu Pro Ala Asn Thr Arg 

100 105 110 

Val Ala Gin Gly Glu Val Ala Leu Met Glu Cys Gly Ala Pro Arg Gly 

115 ^ 120 125 

Ser Pro Glu Pro Gin lie Ser Trp Arg Lys Asn Gly Gin Thr Leu Asn 

130 135 140 

Leu Val Gly Asn Lys Arg lie Arg lie Val Asp Gly Gly Asn Leu Ala 
145 150 155 160 

lie Gin Glu Ala Arg Gin Ser Asp Asp Gly Arg Tyr Gin Cys Val Val 

165 170 175 

Lys Asn Val Val Gly Thr Arg Glu Ser Ala Thr Ala Phe Leu Lys Val 

180 185 190 

His Val Arg Pro Phe Leu lie Arg Gly Pro Gin Asn Gin Thr Ala Val 

195 200 205 

Val Gly Ser Ser Val Val Phe Gin Cys Arg lie Gly Gly Asp Pro Leu 

210 215 220 

Pro Asp Val Leu Trp Arg Arg Thr Ala Ser Gly Gly Asn Met Pro Leu 
225 230 235 240 

Arg Lys Phe Ser Trp Leu His Ser Ala Ser Gly Arg Val His Val Leu 

245 250 255 

Glu Asp Arg Ser Leu Lys Leu Asp Asp Val Thr Leu Glu Asp Met Gly 
260 265 270 
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Glu Tyr Thr Cys Glu Ala Asp Asn Ala Val Gly Gly lie Thr Ala Thr 

275 * 280 285 

Gly lie Leu Thr Val His Ala Pro Pro Lys Phe Val lie Arg Pro Lys 

290 295 300 

Asn Gin Leu Val Glu lie Gly Asp Glu Val Leu Phe Glu Cys Gin Ala 
305 310 315 320 

Asn Gly His Pro Arg Pro Thr Leu Tyr Trp Ser Val Glu Gly Asn Ser 

325 330 335 

Ser Leu Leu Leu Pro Gly Tyr Arg Asp Gly Arg Met Glu Val Thr Leu 

340 345 350 

Thr Pro Glu Gly Arg Ser Val Leu Ser lie Ala Arg Phe Ala Arg Glu 

355 360 365 

Asp Ser Gly Lys Val Val Thr Cys Asn Ala Leu Asn Ala Val Gly Ser 
370 375 380 

W Val Ser Ser Arg Thr Val Val Ser Val Asp Thr Gin Phe Glu Leu Pro 

,fE 385 390 395 400 

^4 Pro Pro lie lie Glu Gin Gly Pro Val Asn Gin Thr Leu Pro Val Lys 

I . 405 410 415 


Ser lie Val Val Leu Pro Cys Arg Thr Leu Gly Thr Pro Val Pro Gin 

420 425 430 

Val Ser Trp Tyr Leu Asp Gly lie Pro He Asp Val Gin Glu His Glu 

435 440 445 

Arg Arg Asn Leu Ser Asp Ala Gly Ala Leu Thr He Ser Asp Leu Gin 

450 455 460 

Arg His Glu Asp Glu Gly Leu Tyr Thr Cys Val Ala Ser Asn Arg Asn 
465 470 475 480 

Gly Lys Ser Ser Trp Ser Gly Tyr Leu Arg Leu Asp Thr Pro Thr Asn 

485 490 495 

Pro Asn He Lys Phe Phe Arg Ala Pro Glu Leu Ser Thr Tyr Pro Gly 

500 505 J 510 

Pro Pro Gly Lys Pro Gin Met Val Glu Lys Gly Glu Asn Ser Val Thr 

515 520 525 

Leu Ser Trp Thr Arg Ser Asn Lys Val Gly Gly Ser Ser Leu Val Gly 

530 535 540 

Tyr Val He Glu Met Phe Gly Lys Asn Glu Thr Asp Gly Trp Val Ala 
545 550 555 560 

Val Gly Thr Arg Val Gin Asn Thr Thr Phe Thr Gin Thr Gly Leu Leu 
565 570 575 
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Pro Gly Val Asn Tyr Phe Phe Leu lie Arg Ala Glu Asn Ser His Gly 

580 585 590 

Leu Ser Leu Pro Ser Pro Met Ser Glu Pro lie Thr Val Gly Thr Arg 

595 600 605 

Tyr Phe Asn Ser Gly Leu Asp Leu Ser Glu Ala Arg Ala Ser Leu Leu 

610 615 620 

Ser Gly Asp Val Val Glu Leu Ser Asn Ala Ser Val Val Asp Ser Thr 
625 630 635 640 

Ser Met Lys Leu Thr Trp Gin lie lie Asn Gly Lys Tyr Val Glu Gly 

645 650 655 

Phe Tyr Val Tyr Ala Arg Gin Leu Pro Asn Pro lie Val Asn Asn Pro 

660 665 670 

Ala Pro Val Thr Ser Asn Thr Asn Pro Leu Leu Gly Ser Thr Ser Thr 

675 680 685 

Ser Ala Ser Ala Ser Ala Ser Ala Ser Ala Leu lie Ser Thr Lys Pro 

690 695 700 

Asn lie Ala Ala Ala Gly Lys Arg Asp Gly Glu Thr Asn Gin Ser Gly 
705 710 715 720 

Gly Gly Ala Pro Thr Pro Leu Asn Thr Lys Tyr Arg Met Leu Thr lie 

725 730 735 

Leu Asn Gly Gly Gly Ala Ser Ser Cys Thr lie Thr Gly Leu Val Gin 

740 745 750 

Tyr Thr Leu Tyr Glu Phe Phe lie Val Pro Phe Tyr Lys Ser Val Glu 

755 760 765 

Gly Lys Pro Ser Asn Ser Arg lie Ala Arg Thr Leu Glu Asp Val Pro 

770 775 780 

Ser Glu Ala Pro Tyr Gly Met Glu Ala Leu Leu Leu Asn Ser Ser Ala 
785 . 790 J 795 800 

Val Phe Leu Lys Trp Lys Ala Pro Glu Leu Lys Asp Arg His Gly Val 

805 810 815 

Leu Leu Asn Tyr His Val lie Val Arg Gly lie Asp Thr Ala His Asn 

820 825 830 

Phe Ser Arg lie Leu Thr Asn Val Thr lie Asp Ala Ala Ser Pro Thr 

835 840 845 

Leu Val Leu Ala Asn Leu Thr Glu Gly Val Met Tyr Thr Val Gly Val 

850 855 860 

Ala Ala Gly Asn Asn Ala Gly Val Gly Pro Tyr Cys Val Pro Ala Thr 
865 870 875 880 
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Leu Arg Leu Asp Pro lie Thr Lys Arg Leu Asp Pro Phe lie Asn Gin 

885 890 895 

Arg Asp His Val Asn Asp Val Leu Thr Gin Pro Trp Phe He He Leu 

900 905 910 

Leu Gly Ala He Leu Ala Val Leu Met Leu Ser Phe Gly Ala Met Val 

915 920 925 

Phe Val Lys Arg Lys His Met Met Met Lys Gin Ser Ala Leu Asn Thr 

930 935 940 

Met Arg Gly Asn His Thr Ser Asp Val Leu Lys Met Pro Ser Leu Ser 
945 950 955 960 

Ala Arg Asn Gly Asn Gly Tyr Trp Leu Asp Ser Ser Thr Gly Gly Met 

965 970 975 

Val Trp Arg Pro Ser Pro Gly Gly Asp Ser Leu Glu Met Gin Lys Asp 

980 985 990 

His He Ala Asp Tyr Ala Pro Val Cys Gly Ala Pro Gly Ser Pro Ala 

995 1000 1005 

Gly Gly Gly Thr Ser Ser Gly Gly Ser Gly Gly Ala Gly Ser Gly Ala 

1010 1015 1020 

Ser Gly Gly Asp Asp He His Gly Gly His Gly Ser Glu Arg Asn Gin 
1025 1030 1035 1040 

Gin Arg Tyr Val Gly Glu Tyr Ser Asn He Pro Thr Asp Tyr Ala Glu 

1045 1050 1055 

Val Ser Ser Phe Gly Lys Ala Pro Ser Glu Tyr Gly Arg His Gly Asn 

1060 1065 1070 

Ala Ser Pro Ala Pro Tyr Ala Thr Ser Ser lie Leu Ser Pro His Gin 

1075 1080 1085 

Gin Gin Gin Gin Gin Gin Pro Arg Tyr Gin Gin Arg Pro Val Pro Gly 

1090 1095 1100 

Tyr Gly Leu Gin Arg Pro Met His Pro His Tyr Gin Gin Gin Gin His 
1105 1110 1115 1120 

Gin Gin Gin Gin Ala Gin Gin Thr His Gin Gin His Gin Ala Leu Gin 

1125 1130 1135 

Gin His Gin Gin Leu Pro Pro Ser Asn He Tyr Gin Gin Met Ser Thr 

1140 1145 1150 

Thr Ser Glu He Tyr Pro Thr Asn Thr Gly Pro Ser Arg Ser Val Tyr 

1155 1160 1165 

Ser Glu Gin Tyr Tyr Tyr Pro Lys Asp Lys Gin Arg His He His He 
1170 1175 1180 
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Thr Glu Asn Lys Leu Ser Asn Cys His Thr Tyr Glu Ala Ala Pro Gly 
1185 1190 1195 1200 

Ala Lys Gin Ser Ser Pro lie Ser Ser Gin Phe Ala Ser Val Arg Arg 

1205 1210 1215 

Gin Gin Leu Pro Pro Asn Cys Ser lie Gly Arg Glu Ser Ala Arg Phe 

1220 1225 1230 

Lys Val Leu Asn Thr Asp Gin Gly Lys Asn Gin Gin Asn Leu Leu Asp 

1235 1240 1245 

Leu Asp Gly Ser Ser Met Cys Tyr Asn Gly Leu Ala Asp Ser Gly Cys 

1250 1255 1260 

Gly Gly Ser Pro Ser Pro Met Ala Met Leu Met Ser His Glu Asp Glu 
1265 1270 1275 1280 

His Ala Leu Tyr His Thr Ala Asp Gly Asp Leu Asp Asp Met Glu Arg 

1285 1290 1295 

Leu Tyr Val Lys Val Asp Glu Gin Gin Pro Pro Gin Gin Gin Gin Gin 
1300 1305 1310 

j Leu lie Pro Leu Val Pro Gin His Pro Ala Glu Gly His Leu Gin Ser 
1315 1320 1325 

Trp Arg Asn Gin Ser Thr Arg Ser Ser Arg Lys Asn Gly Gin Glu Cys 

1330 1335 1340 

lie Lys Glu Pro Ser Glu Leu lie Tyr Ala Pro Gly Ser Val Ala Ser 
1345 1350 1355 1360 

Glu Arg Ser Leu Leu Ser Asn Ser Gly Ser Gly Thr Ser Ser Gin Pro 

1365 1370 1375 

Ala Gly His Asn Val 
1380 


(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
ATGTACTATC TAGGTTTTTA CCACACTCAC ACACACACAC ACACATACAT AAATTTTGAT 60 
AAAATTCCTA ATGCCTCAAA TCTCGCTCCC GTGATAATCG AACATCCCAT CGATGTGGTG 12 0 

GTATCTAGGG GATCGCCAGC AACCCTCAAC TGTGGTGCAA AGCCATCTAC CGCCAAAATC 180 
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ACATGGTACA AGGATGGACA GCCCGTAATC ACGAATAAGG AGCAAGTGAA CAGCCACCGG 24 0 

ATTGTTCTCG ACACGGGATC CCTGTTTCTT CTGAAAGTGA ATAGTGGAAA AAACGGAAAA 3 00 

GACAGCGATG CGGGAGCGTA CTATTGTGTG GCCAGCAACG AGCACGGAGA AGTGAAGTCG 3 60 

AACGAAGGAT CGTTAAAATT GGCGATGCTT CGCGAAGACT TTCGAGTTCG GCCAAGAACA 42 0 

GTTCAGGCTC TTGGTGGAGA GATGGCCGTT CTGGAATGCA GTCCGCCACG TGGATTCCCG 4 80 

GAGCCGGTTG TGAGCTGGCG GAAAGACGAC AAAGAGCTCC GAATTCAAGA CATGCCACGA 54 0 

TACACTCTAC ACTCTGACGG AAACCTCATC ATTGATCCGG TCGATCGAAG CGATTCTGGT 60 0 

ACTTATCAGT GTGTTGCCAA CAACATGGTC GGAGAACGGG TGTCCAATCC CGCAAGATTG 66 0 

AGTGTCTTTG AGAAACCAAA GTTTGAGCAA GAACCCAAGG ACATGACGGT CGACGTCGGA 72 0 

GCCGCAGTGC TGTTTGATTG TCGTGTGACT GGAGATCCTC AACCACAAAT TACGTGGAAA 78 0 

CGCAAAAATG AGCCGATGCC AGTTACACGT GCATACATTG CCAAGGATAA TCGGGGGTTG 84 0 

AGAAT CG AAA GAGTTCAACC ATCAGACGAA GGTGAATACG TTTGCTATGC ACGAAATCCA 90 0 

GCGGGAACTC TTGAAGCATC TGCACATCTT CC3TGTCCAGG CACCTCCATC CTTCCAGACA 96 0 

AAACCAGCAG ACCAGTCAGT TCCAGCTGGA GGCACGGCAA CTTTTGAATG CACCTTGGTC 102 0 

GGTCAACCGA GTCCCGCCTA TTTTTGGAGC AAGGAAGGCC AACAGGATCT TCTTTTCCCA 10 8 0 

AGTTATGTGT CCGCTGATGG TAGAACGAAA ,GTTTCACCAA CTGGAACATT GACAATTGAG 114 0 

GAAGTTCGTC AAGTTGATGA GGGAGCTTAT GTGTGCGCTG GAATGAACTC GGCAGGAAGC 12 0 0 

TCGTTGAGCA AGGCAGCTTT GAAAGCAACA TTTGAAAC C A AAGGCCGTGT CCAAAAAAAA 12 6 0 

AAGAGCAAAA TGGGCAAACA GAAACAAAAA AATGTTCAAT CAATTATCAA ATATTTAATT 13 2 0 

TCAGCCGTGA CCGGAAACAC ACCCGCCAAA CCACCACCAA CAATCGAGCA TGGTCATCAA " 13 8 0 

AATCAGACCC TTATGGTTGG ATCATCAGCC ATCCTTCCAT GTCAGGCTAG CGGAAAACCA 144 0 

ACTCCAGGAA TATCATGGCT CAGGGATGGG CTACCTATTG ACATTACAGA TAGTCGTATC 15 0 0 

AGTCAACATT CAACGGGAAG TCTACATATT GCCGATTTAA AGAAACCTGA CACCGGAGTT 15 6 0 

TACACTTGCA TTGCGAAGAA CGAGGATGGA GAGTCAACAT GGTCGGCATC TCTGACTGTT 162 0 

GAAGATCACA CTAGCAATGC ACAATTTGTT CGGATGCCGG ATCCATCGAA CTTCCCGTCT 16 8 0 

TCTCCAACGC AACCCATTAT TGTCAATGTC ACTGATACCG AAGTAGAGCT CCACTGGAAT 174 0 

GCTCCCTCCA CATCTGGCGC AGGACCAATC ACTGGTTATA TCATTCAGTA CTACAGTCCA 18 0 0 

GACCTCGGAC AGACGTGGTT TAACATTCCA GACTACGTGG CATCTACTGA ATATAGAATA 186 0 

AAGGGTCTGA AACCATCTCA CTCGTATATG TTTGTGATTC GAGCAGAAAA TGAGAAAGGT 192 0 

ATTGGAACGC CGAGTGTGTC GTCGGCTCTC GTTACCACTA GCAAGCCAGC AGCTCAAGTT 198 0 

GCGCTTTCTG ACAAGAACAA AATGGACATG GCCATCGCTG AGAAGAGACT CACTTCGGAA 2 04 0 

CAACTCATAA AACTCGAGGA AGTGAAGACT ATTAATTCTA CGGCCGTTCG TTTGTTCTGG 210 0 

AAGAAGAGGA AACTTGAAGA GCTGATTGAT GGTTACTACA TCAAGTGGAG AGGGCCTCCA 2160 

AGAACCAATG ATAATCAATA CGTGAATGTG ACCAGCCCTA GCACCGAAAA CTATGTTGTT 22 2 0 

TCAAATTTAA TGCCATTCAC CAACTATGAG TTTTTCGTGA TTC CTTATCA TTCCGGAGTT 22 8 0 

CATAGTATTC ATGG AG C AC C GAGTAATTCC ATGGACGTGT TG AC CG C CG A AGCTCCACCT 2 340 

TCATTGCCAC CAGAGGATGT GCGAATCCGT ATGCTCAACC TGACCACTCT TCGTATCTCT 24 0 0 

TGGAAAGCAC CAAAAGCCGA CGGCATCAAC GGAATTCTCA AAGGATTCCA AATTGTTATT 2 4 60 
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GTTGGTCAAG CGCCCAACAA CAATCGGAAC ATCACTACAA ACGAGAGAGC TGCCAGTGTT 2 52 0 

ACTCTGTTCC ATTTAGTGAC TGGAATGACG TATAAAATTC GTGTAGCGGC TAGAAGCAAT 2 58 0 

GGTGGAGTTG GAGTCTCACA TGGAACGAGT GAAGTCATCA TGAATCAAGA CACGCTGGAA 2 64 0 

AAACACCTTG CTGCTCAACA AGAAAACGAA TCATTTTTGT ATGGGCTGAT CAATAAATCT 2 700 

CATGTTCCTG TGATTGTCAT TGTTGCAATT CTGATTATTT TCGTAGTCAT CATTAT AG C C 2760 

TATTGTTACT GGAGGAATAG CAGAAACAGT GATGGAAAGG ATCGAAGTTT TATAAAGATC 2 82 0 

AATGATGGAA GTGTTCATAT GGCTTCGAAT AATCTTTGGG ATGTTGCACA AAATCCGAAT 2 880 

CAGAATCCAA TGTACAACAC TGCTGGAAGA ATGACTATGA ACAATAGAAA TGGCCAGGCT 2 94 0 

CTCTATTCGC TG AC AC C AAA TGCGCAAGAC TTTTTCAACA ATTGTGATGA CTACAGTGGA 3 000 

ACGATGCACA GACCAGGATC CGAGCATCAC TATCATTATG CTCAACTGAC TGGCGGACCT 3 060 

GGTAATGCGA TGTCTACTTT TTATGGAAAC CAATATCACG ATGATCCATC TCCATATGCC 312 0 

ACCACAACAC TGGTCCTGTC GAACCAACAA CCAGCTTGGC TCAATGACAA AATGCTTCGC 318 0 

GCGCCAGCAA TGCCAACAAA TCCCGTGCCA CCAGAGCCAC CGGCGCGATA TGCAGATCAT 3 24 0 

ACCGCTGGAA GACGATCTCG ATCGAGCCGT GCATCCGATG GGAGAGGAAC TCTGAATGGC 33 0 0 

GGACTCCATC AC C GG AC TAG CGGAAGTCAA CGGTCGGATA GTCCACCTCA CACAGATGTG 33 6 0 

AGCTATGTTC AGCTTCACTC ATCCGATGGA ACTGGTAGTA GTAAGGAAAG AACTGGGGAG 34 2 0 

CGGAGAACAC CACCGAATAA GACTCTGATG GACTTTATTC CGCCACCACC TTCCAATCCA 34 8 0 

CCACCACCTG GAGGGCACGT TTATGACACA GCAACTAGGC GTCAGTTGAA TCGTGGAAGT 354 0 


ACTCCACGAG AAGACACCTA CGATTCGGTC AGTGACGGAG CTTTTGCTCG GGTTGATGTG 3 60 0 

ru 

AATGCAAGGC CAACGAGTCG GAATCGGAAT TTGGGAGGAA GGCCGCTGAA AGGGAAACGA 3 66 0 


GACGACGATA GTCAGCGGTC TTCGTTGATG ATGGACGATG ATGGTGGATC TTCTGAAGCT 37 2 0 

GACGGGGAGA ACTCTGAAGG AGACGTTCCG CGTGGAGGTG TTAGAAAAGC AGTTCCTCGA 3 78 0 

ATGGGT AT C T CTGCAAGTAC GCTGGCTCAT AGTTGTTACG GGACAAACGG CACTGCTCAA 3 84 0 

CGATTCCGGT CAATTCCACG TAACAATGGA ATCGTCACAC AAGAACAAAC TTGA 3 894 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Tyr Tyr Leu Gly Phe Tyr His Thr His Thr His Thr His Thr Tyr 

15 10 15 

lie Asn Phe Asp Lys lie Pro Asn Ala Ser Asn Leu Ala Pro Val lie 

20 25 30 

lie Glu His Pro lie Asp Val Val Val Ser Arg Gly Ser Pro Ala Thr 
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Leu Asn Cys Gly Ala 
50 

Asp Gly Gin Pro Val 
65 

lie Val Leu Asp Thr 
85 

Lys Asn Gly Lys Asp 
100 

Asn Glu His' Gly Glu 
115 

Met Leu Arg Glu Asp 
130 

Gly Gly Glu Met Ala 
145 

Glu Pro Val Val Ser 
165 

Asp Met Pro Arg Tyr 
180 

Pro Val Asp Arg Ser 
195 

Met Val Gly Glu Arg 
210 

Lys Pro Lys Phe Glu 
225 

Ala Ala Val Leu Phe 
245 

lie Thr Trp Lys Arg 
260 

lie Ala Lys Asp Asn 
275 

Asp Glu Gly Glu Tyr 
290 

Glu Ala Ser Ala His 
305 

Lys Pro Ala Asp Gin 
325 

Cys Thr Leu Val Gly 


Lys Pro Ser Thr Ala 
55 

lie Thr Asn Lys Glu 

70 . 

Gly Ser Leu Phe Leu 
90 

Ser Asp Ala Gly Ala 
105 

Val Lys Ser Asn Glu 
120 

Phe Arg Val Arg Pro 
135 

Val Leu Glu Cys Ser 
150 

Trp Arg Lys Asp Asp 
170 

Thr Leu His Ser Asp 
185 

Asp Ser Gly Thr Tyr 
200 

Val Ser Asn Pro Ala 
215 

Gin Glu Pro Lys Asp 
230 

Asp Cys Arg Val Thr 
250 

Lys Asn Glu Pro Met 
265 

Arg Gly Leu Arg lie 
280 

Val Cys Tyr Ala Arg 
295 

Leu Arg Val Gin Ala 
310 

Ser Val Pro Ala Gly 
330 

Gin Pro Ser Pro Ala 
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Lys lie Thr Trp Tyr 
60 

Gin Val Asn Ser His 
75 

Leu Lys Val Asn Ser 
95 

Tyr Tyr Cys Val Ala 
110 

Gly Ser Leu Lys Leu 
125 

Arg Thr Val Gin Ala 
140 

Pro Pro Arg Gly Phe 
155 

Lys Glu Leu Arg lie 
175 

Gly Asn Leu lie lie 
190 

Gin Cys Val Ala Asn 
205 

Arg Leu Ser Val Phe 
220 

Met Thr Val Asp Val 
235 

Gly Asp Pro Gin Pro 
255 

Pro Val Thr Arg Ala 
270 

Glu Arg Val Gin Pro 
285 

Asn Pro Ala Gly Thr 
300 

Pro Pro Ser Phe Gin 
315 

Gly Thr Ala Thr Phe 
335 

Tyr Phe Trp Ser Lys 


Lys 

Arg 

80 

Gly 

Ser 

Ala 

Leu 

Pro 
160 
Gin 

Asp 

Asn 

Glu 

Gly 
240 
Gin 

Tyr 

Ser 

Leu 

Thr 
320 
Glu 

Glu 
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340 345 350 

Gly Gin Gin Asp Leu Leu Phe Pro Ser Tyr Val Ser Ala Asp Gly Arg 

355 360 365 

Thr Lys Val Ser Pro Thr Gly Thr Leu Thr He Glu Glu Val Arg Gin 

370 375 380 

Val Asp Glu Gly Ala Tyr Val Cys Ala Gly Met Asn Ser Ala Gly Ser 
385 390 395 400 

Ser Leu Ser Lys Ala Ala Leu Lys Ala Thr Phe Glu Thr Lys Gly Arg 

405 410 415 

Val Gin Lys Lys Lys Ser Lys Met Gly Lys Gin Lys Gin Lys Asn Val 

420 425 430 

Gin Ser He He Lys Tyr Leu He Ser Ala Val Thr Gly Asn Thr Pro 

435 440 445 

Ala Lys Pro Pro Pro Thr He Glu His Gly His Gin Asn Gin Thr Leu 

450 455 460 

Met Val Gly Ser Ser Ala He Leu Pro Cys Gin Ala Ser Gly Lys Pro 
465 470 475 480 

Thr Pro Gly He Ser Trp Leu Arg Asp Gly Leu Pro He Asp He Thr 

485 490 495 

Asp Ser Arg He Ser Gin His Ser Thr Gly Ser Leu His He Ala Asp 
M 500 505 510 

IT Leu Lys Lys Pro Asp Thr Gly Val Tyr Thr Cys He Ala Lys Asn Glu 

J! 515 520 525 

f# Asp Gly Glu Ser Thr Trp Ser Ala Ser Leu Thr Val Glu Asp His Thr 

530 535 540 

Ser Asn Ala Gin Phe Val Arg Met Pro Asp Pro Ser Asn Phe Pro Ser 
545 550 555 560 

Ser Pro Thr Gin Pro He He Val Asn Val Thr Asp Thr Glu Val Glu 

565 570 575 

Leu His Trp Asn Ala Pro Ser Thr Ser Gly Ala Gly Pro He Thr Gly 

580 585 590 

Tyr He He Gin Tyr Tyr Ser Pro Asp Leu Gly Gin Thr Trp Phe Asn 

595 600 605 

He Pro Asp Tyr Val Ala Ser Thr Glu Tyr Arg He Lys Gly Leu Lys 

610 615 620 

Pro Ser His Ser Tyr Met Phe Val He Arg Ala Glu Asn Glu Lys Gly 
625 630 635 640 

He Gly Thr Pro Ser Val Ser Ser Ala Leu Val Thr Thr Ser Lys Pro 
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645 650 655 

Ala Ala Gin Val Ala Leu Ser Asp Lys Asn Lys Met Asp Met Ala lie 

660 665 670 

Ala Glu Lys Arg Leu Thr Ser Glu Gin Leu lie Lys Leu Glu Glu Val 

675 680 685 

Lys Thr lie Asn Ser Thr Ala Val Arg Leu Phe Trp Lys Lys Arg Lys 

690 695 700 

Leu Glu Glu Leu lie Asp Gly Tyr Tyr lie Lys Trp Arg Gly Pro Pro 
705 710 715 720 

Arg Thr Asn Asp Asn Gin Tyr Val Asn Val Thr Ser Pro Ser Thr Glu 

725 730 735 

Asn Tyr Val Val Ser Asn Leu Met Pro Phe Thr Asn Tyr Glu Phe Phe 

740 745 750 

Val lie Pro Tyr His Ser Gly Val His Ser lie His Gly Ala Pro Ser 

755 760 765 

Asn Ser Met Asp Val Leu Thr Ala Glu Ala Pro Pro Ser Leu Pro Pro 

770 775 780 

Glu Asp Val Arg lie Arg Met Leu Asn Leu Thr Thr Leu Arg lie Ser 
785 790 795 800 

Trp Lys Ala Pro Lys Ala Asp Gly lie Asn Gly lie Leu Lys Gly Phe 

805 810 815 

Gin lie Val lie Val Gly Gin Ala Pro Asn Asn Asn Arg Asn lie Thr 

820 825 830 

Thr Asn Glu Arg Ala Ala Ser Val Thr Leu Phe His Leu Val. Thr Gly 

835 ' 840 845 

Met Thr Tyr Lys lie Arg Val Ala Ala Arg Ser Asn Gly Gly Val Gly 

850 855 860 

Val Ser His Gly Thr Ser Glu Val lie Met Asn Gin Asp Thr Leu Glu 
865 870 875 880 

Lys His Leu Ala Ala Gin Gin Glu Asn Glu Ser Phe Leu Tyr Gly Leu 

885 890 895 

lie Asn Lys Ser His Val Pro Val lie Val lie Val Ala lie Leu. lie 

900 905 910 

lie Phe Val Val lie lie lie Ala Tyr Cys Tyr Trp Arg Asn Ser Arg 

915 920 925 

Asn Ser Asp Gly Lys Asp Arg Ser Phe lie Lys lie Asn Asp Gly Ser 

930 935 940 

Val His Met Ala Ser Asn Asn Leu Trp Asp Val Ala Gin Asn Pro Asn 
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945 950 955 960 

Gin Asn Pro Met Tyr Asn Thr Ala' Gly Arg Met Thr Met Asn Asn Arg 

965 970 975 

Asn Gly Gin Ala Leu Tyr Ser Leu Thr Pro Asn Ala Gin Asp Phe Phe 

980 985 990 

Asn Asn Cys Asp Asp Tyr Ser Gly Thr Met His Arg Pro Gly Ser Glu 

995 1000 1005 

His His Tyr His Tyr Ala Gin Leu Thr Gly Gly Pro Gly Asn Ala Met 

1010 1015 1020 

Ser Thr Phe Tyr Gly Asn Gin Tyr His Asp Asp Pro Ser Pro Tyr Ala 
1025 1030 1035 1040 

Thr Thr Thr Leu Val Leu Ser Asn Gin Gin Pro Ala Trp Leu Asn Asp 

1045 1050 1055 

Lys Met Leu Arg Ala Pro Ala Met Pro Thr Asn Pro Val Pro Pro Glu 

1060 1065 1070 

Pro Pro Ala Arg Tyr Ala Asp His Thr Ala Gly Arg Arg Ser Arg Ser 

1075 1080 1085 

Ser Arg Ala Ser Asp Gly Arg Gly Thr Leu Asn Gly Gly Leu His His 

1090 1095 1100 

Arg Thr Ser Gly Ser Gin Arg Ser Asp Ser Pro Pro His Thr Asp Val 
1105 1110 1115 1120 

Ser Tyr Val Gin Leu His Ser Ser Asp Gly Thr Gly Ser Ser Lys Glu 

1125 1130 1135 

Arg Thr Gly Glu Arg Arg Thr Pro Pro Asn Lys Thr Leu Met Asp Phe 

1140 1145 1150 

lie Pro Pro Pro Pro Ser Asn Pro Pro Pro Pro Gly Gly His Val Tyr 

1155 1160 1165 

Asp Thr Ala Thr Arg Arg Gin Leu Asn Arg Gly Ser Thr Pro Arg Glu 

1170 1175 1180 

Asp Thr Tyr Asp Ser Val Ser Asp Gly Ala Phe Ala Arg Val Asp Val 
1185 1190 1195 1200 

Asn Ala Arg Pro Thr Ser Arg Asn Arg Asn Leu Gly Gly Arg Pro Leu 

1205 1210 1215 

Lys Gly Lys Arg Asp Asp Asp Ser Gin Arg Ser Ser Leu Met Met Asp 

1220 1225 1230 

Asp Asp Gly Gly Ser Ser Glu Ala Asp Gly Glu Asn Ser Glu Gly Asp 

1235 1240 1245 

Val Pro Arg Gly Gly Val Arg Lys Ala Val Pro Arg Met Gly lie Ser 
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1250 1255 1260 

Ala Ser Thr Leu Ala His Ser Cys Tyr Gly Thr Asn Gly Thr Ala Gin 
1265 1270 1275 1280 

Arg Phe Arg Ser lie Pro Arg Asn Asn Gly lie Val Thr Gin Glu Gin 

1285 f 1^90 1295 

Thr ; 

(2) INFORMATION FOR SEQ ID NO : 7 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATGAAATGGA AACATGTTCC TTTTTTGGTC ATGATATCAC TCCTCAGCTT ATCCCCAAAT 60 

CACCTGTTTC TGGCCCAGCT TATTC CAGAC CCTGAAGATG TAGAGAGGGG GAACGACCAC 12 0 

GGGACGCCAA TCCCCAC^C^^^^ACGAT GACAATTCGC TGGGCTATAC AGGCTCCCGT 18 0 

CTTCGTCAGG AAGATTTTCC ACCTCGCATT GTTGAACACC CTTCAGACCT GATTGTCTCA 24 0 

AAAGGAGAAC CTGCAACTTT GAACTGCAAA GCTGAAGGCC GCCCCACACC CACTATTGAA 300 

TGGTACAAAG GGGGAGAGAG AGTGGAGACA GACAAAGATG ACCCTCGCTC AC AC CGAATG 3 60 

TTGCTGCCGA GTGGATCTTT ATTTTTCTTA CGTATAGTAC ATGGACGGAA AAGTAGACCT 42 0 

GATGAAGGAG TCTATGTCTG TGTAGCAAGG AATTACCTTG GAGAGGCTGT GAGCCACAAT 480 

GCATCGCTGG AAGTAGC CAT ACTTCGGGAT GACTT CAGAC AAAACCCTTC GGATGTCATG 54 0 

GTTGCAGTAG GAGAGCCTGC AGTAATGGAA TGCCAACCTC CACGAGGCCA TCCTGAGCCC 600 

ACCATTTCAT GGAAGAAAGA TGGCTCTCCA CTGGATGATA AAGATGAAAG AATAACTATA ' 660 

CGAGGAGGAA AGCTCATGAT CACTTACACC CGTAAAAGTG ACGCTGGCAA ATATGTTTGT 72 0 

GTTGGTACCA ATATGGTTGG GGAACGTGAG AGTGAAGTAG CCGAGC TGAC TGTCTTAGAG 7 80 

AGACCATCAT TTGTGAAGAG ACCCAGTAAC TTGGCAGTAA CTGTGGATGA CAGTGCAGAA 84 0 

TTTAAATGTG AGGCCCGAGG TGACC CTGTA CCTACAGTAC GATGGAGGAA AGATGATGGA 900 

GAGCTGCCCA AATCCAGATA TGAAATCCGA GATGATCATA CCTTGAAAAT TAGGAAGGTG 960 

ACAGCTGGTG ACATGGGTTC ATACACTTGT GTTGCAGAAA ATATGGTGGG CAAAGCTGAA 102 0 

GCATCTGCTA CTCTGACTGT TCAAGAACCT CCACATTTTG TTGTGAAACC CCGTGACCAG 10 80 

GTTGTTGCTT TGGGACGGAC TGTAACTTTT CAGTGTGAAG CAACCGGAAA TCCTCAACCA 114 0 

GCTATTTTCT GGAGGAGAGA AGGGAGTCAG AATCTACTTT TCTCATATCA ACCACCACAG 12 0 0 

TCATCCAGCC GATTTTCAGT CTCCCAGACT GGCGACCTCA CAATTACTAA TGTCCAGCGA 12 60 

TCTGATGTTG GTTATTACAT CTGCCAGACT TTAAATGTTG CTGGAAGCAT CATCACAAAG 132 0 

GCATATTTGG AAGTTACAGA TGTGATTGCA GATCGGCCTC CCCCAGTTAT TCGACAAGGT 13 80 
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CCTGTGAATC AGACTGTAGC CGTGGATGGC ACTTTCGTCC TCAGCTGTGT GGCCACAGGC 144 0 

AGTCCAGTGC CCACCATTCT GTGGAGAAAG GATGGAGTCC TCGTTTCAAC CCAAGACTCT 1500 

CGAATCAAAC AGTTGGAGAA TGGAGTACTG CAGATCCGAT ATGCTAAGCT GGGTGATACT 15 60 

GGTCGGTACA CCTGCATTGC ATCAACCCCC AGTGGTGAAG CAACATGGAG TGCTTACATT 1620 

GAAGTTCAAG AATTTGGAGT TCCAGTTCAG CCTCCAAGAC CTACTGACCC AAATTTAATC 168 0 

CCTAGTGCCC CATCAAAACC TGAAGTGACA GATGTCAGCA GAAATACAGT CACATTATCG 174 0 

TGGCAACCAA ATTTGAATTC AGGAGCAACT CCAACATCTT ATATTATAGA AGCCTTCAGC 18 00 

CATGCATCTG GTAGCAGCTG GCAGACCGTA GCAGAGAATG TGAAAACAGA AACATCTGCC 1860 . 

ATTAAAGGAC TCAAACCTAA TGCAATTTAC CTTTTCCTTG TGAGGGCAGC TAATGCATAT 192 0 

GGAATTAGTG ATCCAAGCCA AATATCAGAT CCAGTGAAAA CACAAGATGT CCTACCAACA 19 80 

AGTCAGGGGG TGGACCACAA GCAGGTCCAG AGAGAGCTGG GAAATGCTGT TCTGCACCTC 2 04 0 

CACAACCCCA CCGTCCTTTC TTCCTCTTCC ATCGAAGTGC ACTGGACAGT AGATCAACAG 2100 

TCTCAGTATA TACAAGGATA TAAAATTCTC TATCGGCCAT CTGGAGCCAA C C ACGG AGAA 216 0 

TCAGACTGGT TAGTTTTTGA AGTGAGGACG CCAGCCAAAA ACAGTGTGGT AATCCCTGAT 2 22 0 

CTCAGAAAGG GAGTCAACTA TGAAATTAAG GCTCGCCCTT TTTTTAATGA ATTTCAAGGA 22 8 0 

GCAGATAGTG AAATCAAGTT TGCCAAAACC CTGGAAGAAG CACCCAGTGC CCCACCCCAA 2 34 0 

GGTGTAACTG TATCCAAGAA TGATGGAAAC GGAACTGCAA TTCTAGTTAG TTGGCAGCCA 24 0 0 

CCTCCAGAAG ACACTCAAAA TGGAATGGTC CAAGAGTATA AGGTTTGGTG TCTGGGCAAT 2 460 

GAAACTCGAT ACCACATCAA CAAAACAGTG GATGGTTCCA CCTTTTCCGT GGTCATTCCC 2 52 0 

TTTCTTGTTC CTGGAATCCG ATACAGTGTG GAAGTGGCAG CCAGCACTGG GGCTGGGTCT 2 58 0 

GGGGTAAAGA GTGAGCCTCA GTTCATCCAG CTGGATGCCC ATGGAAACCC TGTGTCACCT 2 64 0 

GAGGACCAAG TCAGCCTCGC TC AG CAG ATT T C AG ATGTGG TGAAGCAGCC GGCCTTCATA 2 70 0 

GCAGGTATTG GAGCAGCCTG TTGGATCATC CTCATGGTCT TCAGCATCTG GCTTTATCGA 2 760 

CACCGCAAGA AGAGAAACGG ACTTACTAGT ACCTACGCGG GTATCAGAAA AGTCCCGTCT 2 82 0 

TTTACCTTCA CACCAACAGT AACTTACCAG AGAGGAGGCG AAGCTGTCAG CAGTGGAGGG 2 8 80 

AGGCCTGGAC TTCTCAACAT CAGTGAACCT GCCGCGCAGC CATGGCTGGC AGACACGTGG 2 94 0 

CCTAATACTG GCAACAACCA CAATGACTGC TCCATCAGCT GCTGCACGGC AGGCAATGGA 3 00 0 

AACAGCGACA GCAACCTCAC TACCTACAGT CGCCCAGCTG ATTGTATAGC AAATTATAAC 3 06 0 

AACCAACTGG ATAACAAACA AACAAATCTG ATGCTCCCTG AGTCAACTGT TTATGGTGAT 312 0 

GTGGACCTTA GTAACAAAAT CAATGAGATG AAAACCTTCA ATAGCCCAAA TCTGAAGGAT 318 0 

GGGCGTTTTG TCAATCCATC AGGGCAGCCT ACTCCTTACG CCACCACTCA GCTCATCCAG 3 24 0 

TCAAACCTCA GCAACAACAT GAACAATGGC AGCGGGGACT CTGGCGAGAA GCACTGGAAA 3 300 

CCACTGGGAC AGCAGAAACA AGAAGTGGCA CCAGTTCAGT ACAACATCGT GGAGCAAAAC 3 3 60 

AAGCTGAACA AAGATTATCG AGCAAATGAC ACAGTTCCTC CAACTATCCC AT AC AAC C AA 3 42 0 

TCATACGACC AGAACACAGG AGGATCCTAC AACAGCTCAG ACCGGGGCAG TAGTACATCT 3 480 

GGGAGTCAGG GGCACAAGAA AGGGGCAAGA ACACCCAAGG TACCAAAACA GGGTGGCATG 3 54 0 

AACTGGGCAG ACCTGCTTCC TCCTCCCCCA GCACATCCTC CTCCACACAG CAATAGCGAA 3 600 

GAGTACAACA TTTCTGTAGA TGAAAGCTAT GACCAAGAAA TGCCATGTCC CGTGCCACCA 3 660 
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GCAAGGATGT 

ATTTGCAACA 

AGATGAATTA 

GAAGAGGAGG 

AAGATGAACG 

AGGCCCCACT 

3720 

CCCCCTGTTC 

GGGG AG C AG C 

TTCTTCTCCA 

GCTGCCGTGT 

CCTATAGCCA 

TCAGTCCACT 

3780 

GCCACTCTGA 

CTCCCTCCCC 

ACAGGAAGAA 

CTCCAGCCCA 

TGTTACAGGA 

TTGTCCAGAG 

3840 

GAGACTGGCC 

ACATGCAGCA 

CCAGCCCGAC 

AGGAGACGGC 

AGCCTGTGAG 

TCCTCCTCCA 

3900 

CCACCACGGC 

CGATCTCCCC 

TCCACATACC 

TATGGCTACA 

TTTCAGGACC 

CCTGGTCTCA 

3960 

GATATGGATA 

CGGATGCGCC 

AGAAGAGGAA 

GAAGACGAAG 

CCGACATGGA 

GGTAGCCAAG 

4020 

ATGCAAACCA 

GAAGGCTTTT 

GTTACGTGGG 

CTTGAGCAGA 

CACCTGCCTC 

CAGTGTTGGG 

4080 

GACCTGGAGA 

GCTCTGTCAC 

GGGGTCCATG 

ATCAACGGCT 

GGGGCTCAGC 

CTCAGAGGAG 

4140 

GACAACATTT 

CCAGCGGACG 

CTCCAGTGTT 

AGTTCTTCGG 

ACGGCTCCTT 

TTTCACTGAT 

4200 

GCTGACTTTG 

CCCAGGCAGT 

CGCAGCAGCG 

GCAGAGTATG 

CTGGTCTGAA 

AGTAGCACGA 

4260 

CGGCAAATGC 

AGGATGCTGC 

TGGCCGTCGA 

CATTTTCATG 

CGTCTCAGTG 

CCCTAGGCCC 

4320 

ACAAGTCCCG 

TGTCTACAGA 

CAGCAACATG 

AGTGCCGCCG 

TAATGCAGAA 

AACCAGACCA 

4380 

GCCAAGAAAC 

TGAAACACCA 

GCCAGGACAT 

CTGCGCAGAG 

AAACCTACAC 

AGATGATCTT 

4440 

CCACCACCTC 

CTGTGCCGCC 

AC CTGCTATA 

AAGTCACCTA 

CTGCCCAATC 

CAAGACACAG 

4500 

C TGGAAGT AC 

GACCTGTAGT 

GGTGCCAAAA 

CTCCCTTCTA 

TGGATGCAAG 

AACAGACAGA 

4560 

TCATCAGACA 

GAAAAGGAAG 

C AG TT AC AAG 

GGGAGAGAAG 

TGTTGGATGG 

AAGACAGGTT 

4620 

GTTGACATGC 

GAACAAATCC 

AGGTGATCCC 

AGAGAAGCAC 

AGGAACAGCA 

AAATGACGGG 

4680 

AAAGGACGTG 

GAAACAAGGC 

AGCAAAACGA 

GACCTTCCAC 

CAGCAAAGAC 

TCATCTCATC 

4740 

C AAG AG GAT A 

TTCTACCTTA 


7\ r ir PT"T r rr^f~ 1 ZiZ\ 
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AG AAlC AAp C A 

4800 

CCCAGTTCCT 

CAAGCTCAAT 

GTCATCAAGA 

GGATCAGGAA 

GCAGACAAAG 

4860 

AATGTAGGTC 

GAAGAAATAT 

TGCAGAAATG 

CAGGTACTTG 

GAGGATATGA 

AAGAGGAGAA 

4920 

GATAATAATG 

AAGAATTAGA 

GGAAACTGAA 

AGCTGA 



4956 


iS (2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1651 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Lys Trp Lys His Val Pro Phe Leu Val Met lie Ser Leu Leu Ser 

15 10 15 

Leu Ser Pro Asn His Leu Phe Leu Ala Gin Leu lie Pro Asp Pro Glu 

20 25 30 

Asp Val Glu Arg Gly Asn Asp His Gly Thr Pro lie Pro Thr Ser Asp 

3 5 4 0 45 

Asn Asp Asp Asn Ser Leu Gly Tyr Thr Gly Ser Arg Leu Arg Gin Glu 
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ill 


Mi- 


50 55 60 

Asp Phe Pro Pro Arg lie Val Glu His Pro Ser Asp Leu lie Val Ser 
65 70 75 80 

Lys Gly Glu Pro Ala Thr Leu Asn Cys Lys Ala Glu Gly Arg Pro Thr 

85 90 95 

Pro Thr lie Glu Trp Tyr Lys Gly Gly Glu Arg Val Glu Thr Asp Lys 

100 105 110 

Asp Asp Pro Arg Ser His Arg Met Leu Leu Pro Ser Gly Ser Leu Phe 

115 120 125 

Phe Leu Arg lie Val His Gly Arg Lys Ser Arg Pro Asp Glu Gly Val 

130 135 140 

Tyr Val Cys Val Ala Arg Asn Tyr Leu Gly Glu Ala Val Ser His Asn 
145 150 155 160 

Ala Ser Leu Glu Val Ala lie Leu Arg Asp Asp Phe Arg Gin Asn Pro 

165 170 175 

Ser Asp Val Met Val Ala Val Gly Glu Pro Ala Val Met Glu Cys Gin 

180 185 190 

Pro Pro Arg Gly His Pro Glu Pro Thr lie Ser Trp Lys Lys Asp Gly 

195 200 205 

Ser Pro Leu Asp Asp Lys Asp Glu Arg lie Thr lie Arg Gly Gly Lys 

210 215 220 

Leu Met lie Thr Tyr Thr Arg Lys Ser Asp Ala Gly Lys Tyr Val Cys 
225 230 235 240 

Val Gly Thr Asn Met Val Gly Glu Arg Glu Ser Glu Val Ala Glu Leu 

245 250 255 

Thr Val Leu Glu Arg Pro Ser Phe Val Lys Arg Pro Ser Asn Leu Ala 

260 265 270 

Val Thr Val Asp Asp Ser Ala Glu Phe Lys Cys Glu Ala Arg Gly Asp 

275 280 285 

Pro Val Pro Thr Val Arg Trp Arg Lys Asp Asp Gly Glu Leu Pro Lys 

290 295 300 

Ser Arg Tyr Glu lie Arg Asp Asp His Thr Leu Lys lie Arg Lys Val 
305 310 315 320 

Thr Ala Gly Asp Met Gly Ser Tyr Thr Cys Val Ala Glu Asn Met Val 

325 330 335 

Gly Lys Ala Glu Ala Ser Ala Thr Leu Thr Val Gin Glu Pro Pro His 

340 345 350 

Phe Val Val Lys Pro Arg Asp Gin Val Val Ala Leu Gly Arg Thr Val 
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355 360 365 

Thr Phe Gin Cys Glu Ala Thr Gly Asn Pro Gin Pro Ala lie Phe Trp 

370 375 380 

Arg Arg Glu Gly Ser Gin Asn Leu Leu Phe Ser Tyr Gin Pro Pro Gin 
385 390 395 400 

Ser Ser Ser Arg Phe Ser Val Ser Gin Thr Gly Asp Leu Thr lie Thr 

405 410 415 

Asn Val Gin Arg Ser Asp Val Gly Tyr Tyr lie Cys Gin Thr Leu Asn 

420 425 430 

Val Ala Gly Ser lie lie Thr Lys Ala Tyr Leu Glu Val Thr Asp Val 

435 440 445 

lie Ala Asp Arg Pro Pro Pro Val lie Arg Gin Gly Pro Val Asn Gin 

450 455 460 

Thr Val Ala Val Asp Gly Thr Phe Val Leu Ser Cys Val Ala Thr Gly 
465 470 475 480 

Ser Pro Val Pro Thr lie Leu Trp Arg Lys Asp Gly Val Leu Val Ser 

485 490 495 

Thr Gin Asp Ser Arg lie Lys Gin Leu Glu Asn Gly Val Leu Gin lie 

500 505 510 

Arg Tyr Ala Lys Leu Gly Asp Thr Gly Arg Tyr Thr Cys lie Ala Ser 

515 520 525 

Thr Pro Ser Gly Glu Ala Thr Trp Ser Ala Tyr lie Glu Val Gin Glu 

530 535 540 

Phe Gly Val Pro Val Gin Pro Pro Arg Pro Thr Asp Pro Asn Leu lie 
545 550 555 560 

Pro Ser Ala Pro Ser Lys Pro Glu Val Thr Asp Val Ser Arg Asn Thr 

565 570 575 

Val Thr Leu Ser Trp Gin Pro Asn Leu Asn Ser Gly Ala Thr Pro Thr 

580 585 590 

Ser Tyr lie lie Glu Ala Phe Ser His Ala Ser Gly Ser Ser Trp Gin 

595 600 605 

Thr Val Ala Glu Asn Val Lys Thr Glu Thr Ser Ala lie Lys Gly Leu 

610 615 620 

Lys Pro Asn Ala lie Tyr Leu Phe Leu Val Arg Ala Ala Asn Ala Tyr 
625 630 635 640 

Gly lie Ser Asp Pro Ser Gin lie Ser Asp Pro Val Lys Thr Gin Asp 

645 650 655 

Val Leu Pro Thr Ser Gin Gly Val Asp His Lys Gin Val Gin Arg Glu 
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660 665 670 

Leu Gly Asn Ala Val Leu His Leu His Asn Pro Thr Val Leu Ser Ser 

675 680 685 

Ser Ser lie Glu Val His Trp Thr Val Asp Gin Gin Ser Gin Tyr lie 

690 695 / 700 

Gin Gly Tyr Lys lie Leu Tyr Arg Pro Ser Gly Ala Asn His Gly Glu 
705 710 715 720 

Ser Asp Trp Leu Val Phe Glu Val Arg Thr Pro Ala Lys Asn Ser Val 

725 730 735 

Val lie Pro Asp Leu Arg Lys Gly Val Asn Tyr Glu lie Lys Ala Arg 

740 745 750 

Pro Phe Phe Asn Glu Phe Gin Gly Ala Asp Ser Glu lie Lys Phe Ala 

755 760 765 

Lys Thr Leu Glu Glu Ala Pro Ser Ala Pro Pro Gin Gly Val Thr Val 

770 775 780 

Ser Lys Asn Asp Gly Asn Gly Thr Ala lie Leu Val Ser Trp Gin Pro 
785 790 795 800 

Pro Pro Glu Asp Thr Gin Asn Gly Met Val Gin Glu Tyr Lys Val Trp 

805 810 815 

Cys Leu Gly Asn Glu Thr Arg Tyr His lie Asn Lys Thr Val Asp Gly 

820 825 830 

Ser Thr Phe Ser Val Val lie Pro Phe Leu Val Pro Gly lie Arg Tyr 

835 840 845 

Ser Val Glu Val Ala Ala Ser Thr Gly Ala Gly Ser Gly Val Lys Ser 

850 855 860 

Glu Pro Gin Phe lie Gin Leu Asp Ala His Gly Asn Pro Val Ser Pro 
865 870 875 880 

Glu Asp Gin Val Ser Leu Ala Gin Gin lie Ser Asp Val Val Lys Gin 

885 890 895 

Pro Ala Phe lie Ala Gly lie Gly Ala Ala Cys Trp lie lie Leu Met 

900 905 910 

Val Phe Ser lie Trp Leu Tyr Arg His Arg Lys Lys Arg Asn Gly Leu 

915 920 925 

Thr Ser Thr Tyr Ala Gly lie Arg Lys Val Pro Ser Phe Thr Phe Thr 

930 935 940 

Pro Thr Val Thr Tyr Gin Arg Gly Gly Glu Ala Val Ser Ser Gly Gly 
945 950 955 960 

Arg Pro Gly Leu Leu Asn lie Ser Glu Pro Ala Ala Gin Pro Trp Leu 
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965 970 975 

Ala Asp Thr Trp Pro Asn Thr Gly Asn Asn His Asn Asp Cys Ser He 

980 985 990 

Ser Cys Cys Thr Ala Gly Asn Gly Asn Ser Asp Ser Asn Leu Thr Thr 

995 1000 1005 

Tyr Ser Arg Pro Ala Asp Cys He Ala Asn Tyr Asn Asn Gin Leu Asp 

1010 1015 1020 

Asn Lys Gin Thr Asn Leu Met Leu Pro Glu Ser Thr Val Tyr Gly Asp 
1025 1030 1035 1040 

Val Asp Leu Ser Asn Lys He Asn Glu Met Lys Thr Phe Asn Ser Pro 

1045 1050 1055 

Asn Leu Lys Asp Gly Arg Phe Val Asn Pro Ser Gly Gin Pro Thr Pro 

1060 1065 1070 

Tyr Ala Thr Thr Gin Leu He Gin Ser Asn Leu Ser Asn Asn Met Asn 
q 1075 1080 1085 

0J Asn Gly Ser Gly Asp Ser Gly Glu Lys His Trp Lys Pro Leu Gly Gin 

CI. 1090 1095 1100 

M= Gin Lys Gin Glu Val Ala Pro Val Gin Tyr Asn He Val Glu Gin Asn 

Ln 1105 1110 1115 1120 

FU Lys Leu Asn Lys Asp Tyr Arg Ala Asn Asp Thr Val Pro Pro Thr He 


1125 1130 1135 

Pro Tyr Asn Gin Ser Tyr Asp Gin Asn Thr Gly Gly Ser Tyr Asn Ser 

1140 1145 1150 

Ser Asp Arg Gly Ser Ser Thr Ser Gly Ser Gin Gly His Lys Lys Gly 

1155 1160 1165 

Ala Arg Thr Pro Lys Val Pro Lys Gin Gly Gly Met Asn Trp Ala Asp 

1170 1175 1180 

Leu Leu Pro Pro Pro Pro Ala His Pro Pro Pro His Ser Asn Ser Glu 
1185 1190 1195 1200 

Glu Tyr Asn He Ser Val Asp Glu Ser Tyr Asp Gin Glu Met Pro Cys 

1205 1210 1215 

Pro Val Pro Pro Ala Arg Met Tyr Leu Gin Gin Asp Glu Leu Glu Glu 

1220 1225 1230 

Glu Glu Asp Glu Arg Gly Pro Thr Pro Pro Val Arg Gly Ala Ala Ser 

1235 1240 1245 

Ser Pro Ala Ala Val Ser Tyr Ser His Gin Ser Thr Ala Thr Leu Thr 

1250 1255 1260 

Pro Ser Pro Gin Glu Glu Leu Gin Pro Met Leu Gin Asp Cys Pro Glu 
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1265 1270 1275 1280 

Glu Thr Gly His Met Gin His Gin Pro Asp Arg Arg Arg Gin Pro Val 

1285 1290 1295 

Ser Pro Pro Pro Pro Pro Arg Pro lie Ser Pro Pro His Thr Tyr Gly 

1300 1305 1310 

Tyr lie Ser Gly Pro Leu Val Ser Asp Met Asp Thr Asp Ala Pro Glu 

1315 1320 1325 

Glu Glu Glu Asp Glu Ala Asp Met Glu Val Ala Lys Met Gin Thr Arg 

1330 1335 1340 

Arg Leu Leu Leu Arg Gly Leu Glu Gin Thr Pro Ala Ser Ser Val Gly 
1345 1350 1355 1360 

Asp Leu Glu Ser Ser Val Thr Gly Ser Met lie Asn Gly Trp Gly Ser 

1365 1370 1375 

Ala Ser Glu Glu Asp Asn He Ser Ser Gly Arg Ser Ser Val Ser Ser 
p 1380 1385 1390 

4f Ser Asp Gly Ser Phe Phe Thr Asp Ala Asp Phe Ala Gin Ala Val Ala 

Q 1395 1400 1405 

Ala Ala Ala Glu Tyr Ala Gly Leu Lys Val Ala Arg Arg Gin Met Gin 
1410 1415 1420 

fU Asp Ala Ala Gly Arg Arg His Phe His Ala Ser Gin Cys Pro Arg Pro 

1425 1430 1435 1440 

Thr Ser Pro Val Ser Thr Asp Ser Asn Met Ser Ala Ala Val Met Gin 
? !l 1445 1450 1455 

yi Lys Thr Arg Pro Ala Lys Lys Leu Lys His Gin Pro Gly His Leu Arg 

^ 1460 1465 1470 

Arg Glu Thr Tyr Thr Asp Asp Leu Pro Pro Pro Pro Val Pro Pro Pro 

1475 1480 1485 ' 

Ala He Lys Ser Pro Thr Ala Gin Ser Lys Thr Gin Leu Glu Val Arg 

1490 1495 1500 

Pro Val Val Val Pro Lys Leu Pro Ser Met Asp Ala Arg Thr Asp Arg 
1505 1510 1515 1520 

Ser Ser Asp Arg Lys Gly Ser Ser Tyr Lys Gly Arg Glu Val Leu Asp 

1525 1530 1535 

Gly Arg Gin Val Val Asp Met Arg Thr Asn Pro Gly Asp Pro Arg Glu 

1540 1545 1550 

Ala Gin Glu Gin Gin Asn Asp Gly Lys Gly Arg Gly Asn Lys Ala Ala 

1555 • 1560 1565 

Lys Arg Asp Leu Pro Pro Ala Lys Thr His Leu He Gin Glu Asp He 
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1570 1575 1580 

Leu Pro Tyr Cys Arg Pro Thr Phe Pro Thr Ser Asn Asn Pro Arg Asp 
1585 1590 1595 1600 

Pro Ser Ser Ser Ser Ser Met Ser Ser Arg Gly Ser Gly Ser Arg Gin 

1605 1610 1615 

Arg Glu Gin Ala Asn Val Gly Arg Arg Asn lie Ala Glu Met Gin Val 

1620 1625 1630 

Leu Gly Gly Tyr Glu Arg Gly Glu Asp Asn Asn Glu Glu Leu Glu Glu 

1635 1640 1645 

Thr Glu Ser 
1650 


(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 
(ix) FEATURE: 

(A) NAME/ ICE Y : misc_f eature 

(B) LOCATION: 855 .. 1187 

(D) OTHER INFORMATION: /note= "N signifies gap in sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 


CAGATTGTTG 

CTCAAGGTCG 

AACAGTGACA 

TTTCCCTGTG 

AAACTAAAGG 

AAAC C C AC AG 

60 

CCAGCTGTTT 

TTTGGCAGAA 

AGAAGGCAGC 

CAGAACCTAC 

TTTTCCCAAA 

CCAACCCCAG 

120 

CAGCCCAACA 

GTAGATGCTC 

AGTGTCACCA 

ACTGGAGACC 

TCACAATCAC 

CAACATTCAA 

180 

CGTTCCGACG 

CGGGTTACTA 

CATCTGCCAG 

GCTTTAACTG 

TGGCAGGAAG 

CATTTTAGCA 

240 

AAAGCTCAAC 

TGGAGGTTAC 

TGATGTTTTG 

ACAGATAGAC 

CTCCACCTAT 

AATTCTACAA 

300 

GGCCCAGCCA 

ACCAAACGCT 

GGCAGTGGAT 

GGTACAGCGT 

T AC TG AAATG 

TAAAGCCACT 

360 

GGTGATCCTC 

TTCCTGTAAT 

TAGCTGGTTA 

AAGGAGGGAT 

TTACTTTTCC 

GGGTAGAGAT 

420 

CCAAGAGCAA 

CAATTCAAGA 

GCAAGG C AC A 

CTG C AGATTA 

AGAATTTACG 

GATTTCTGAT 

480 

ACTGGCACTT 

ATACTTGTGT 

GGCTACAAGT 

TCAAGTGGAG 

AGGCTTCCTG 

GAGTGCAGTG 

540 

CTGGATGTGA 

CAGAGTCTGG 

AGCAACAATC 

AGTAAAAACT 

ATGATTTAAG 

TGACCTGCCA 

600 

GGGCCACCAT 

CCAAACCGCA 

AGTCACTGAT 

GTTACTAAGA 

ACAGTGTCAC 

CTTGTCCTGG 

660 

CAGCCAGGTA 

CCCCTGGAAC 

CCTTCCAGCA 

AGTGCATATA 

TCATTGAGGC 

TTTCAGCCAA 

720 

TCAGTGAGCA 

ACAGCTGGCA 

GACCGTGGCA 

AACCATGTAA 

AGACCACCCT 

CTATACTGTA 

780 

AGAGGACTGC 

GGCCCAATAC 

AATCTACTTA 

TTCATGGTCA 

GAGCGATCAA 

CCCCAAGGTY 

840 
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TCAGTGACCC 

AAGTNAAAC C 

ACAGAAAAAC 

AATGGATC C A 

CTTGGGCCAA 

TGTCCCTCTA 

900 

CCTCCCCCCC 

CAGTCCAGCC 

CCTTCCTGGC 

ACGGAGCTGG 

AACACTATGC 

AGTGGAACAA 

960 

CAAGAAAATG 

GCTATGACAG 

TGATAGCTGG 

TGCCCACCAT 

TGCCAGTACA 

AACTTACTTA 

1020 

CACCAAGGTC 

TGGAAGATGA 

ACTGGAAGAA 

GATGATGATA 

GGGTCCCAAC 

ACCTCCTGTT 

1080 

CGAGGCGTGG 

CTTCTTCTCC 

TdCTAl v- 1 t-t- 

rpf-prp/-i/~i 7V (~*7\ r~*f~* 



114 0 

CCATCCCCAC 

GGGAAGAGAT 

GCAACCCATG 

CTGCAGGCTT 

CACCTNTTTA 

CCTCCTCTCA 

1200 

AAGACCTCGA 

CCTACCAGCC 

CATTTTCTAC 

TGACAGTAAC 

ACCAGTGCAG 

CCCTGAGTCA 

1260 

AAGTCAGAGG 

CCTCGGCCCA 

CTAAAAAACA 

CAAGGGAGGG 



1300 


(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: Modif ied- s i te 

(B) LOCATION: 285.. 396 

(D) OTHER INFORMATION: /note= "Xaa signifies gap in sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gin lie Val Ala Gin Gly Arg Thr Val Thr Phe Pro Cys Glu Thr Lys 

1 5 10 15 

Gly Asn Pro Gin Pro Ala Val Phe Trp Gin Lys Glu Gly Ser Gin Asn 

20 25 30 

Leu Leu Phe Pro Asn Gin Pro Gin Gin Pro Asn Ser Arg Cys Ser Val 

35 40 45 

Ser Pro Thr Gly Asp Leu Thr lie Thr Asn lie Gin Arg Ser Asp Ala 

50 55 60 

Gly Tyr Tyr lie Cys Gin Ala Leu Thr Val Ala Gly Ser lie Leu Ala 
65 70 75 80 

Lys Ala Gin - Leu Glu Val Thr Asp Val Leu Thr Asp Arg Pro Pro Pro 

85 90 95 

lie lie Leu Gin Gly Pro Ala Asn Gin Thr Leu Ala Val Asp Gly Thr 

100 105 110 

Ala Leu Leu Lys Cys Lys Ala Thr Gly Asp Pro Leu Pro Val lie Ser 

115 120 125 

Trp Leu Lys Glu Gly Phe Thr Phe Pro Gly Arg Asp Pro Arg Ala Thr 
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. 130 135 140 

lie Gin Glu Gin Gly Thr Leu Gin lie Lys Asn Leu Arg lie Ser Asp 
145 150 155 160 

Thr Gly Thr Tyr Thr Cys'Val Ala Thr Ser Ser Ser Gly Glu Ala Ser 

165 170 175 

Trp Ser Ala Val Leu Asp Val Thr Glu Ser Gly Ala Thr He Ser Lys 

180 185 190 

Asn Tyr Asp Leu Ser Asp Leu Pro Gly Pro Pro Ser Lys Pro Gin Val 

195 200 205 

Thr Asp Val Thr Lys Asn Ser Val Thr Leu Ser Trp Gin Pro Gly Thr 

210 215 220 

Pro Gly Thr Leu Pro Ala Ser Ala Tyr He He Glu Ala Phe Ser Gin 
225 230 235 240 

Ser Val Ser Asn Ser Trp Gin Thr Val Ala Asn His Val Lys Thr Thr 

, 245 250 255 

Leu Tyr Thr Val Arg Gly Leu Arg Pro Asn Thr He Tyr Leu Phe Met 

260 265 270 

Val Arg Ala He Asn Pro Lys Val Ser Val Thr Gin Xaa Lys Pro Gin 

275 280 285 

Lys Asn Asn Gly Ser Thr Trp Ala Asn Val Pro Leu Pro Pro Pro Pro 

290 295 300 

Val Gin Pro Leu Pro Gly Thr Glu Leu Glu His Tyr Ala Val Glu Gin 
305 310 315 320 

Gin Glu Asn Gly Tyr Asp Ser Asp Ser Trp Cys Pro Pro Leu Pro Val 

325 330 335 

Gin Thr Tyr Leu His Gin Gly Leu Glu Asp Glu Leu Glu Glu Asp Asp 

340 345 350 

Asp Arg Val Pro Thr Pro Pro Val Arg Gly Val Ala Ser Ser Pro Ala 

355 360 365 

He Ser Phe Gly Gin Gin Ser Thr Ala Thr Leu Thr. Pro Ser Pro Arg 

370 375 380 

Glu Glu Met . Gin Pro Met Leu Gin Ala Ser Pro Xaa Phe Thr Ser Ser 
385 390 395 400 

Gin Arg Pro Arg Pro Thr Ser Pro Phe Ser Thr Asp Ser Asn Thr Ser 

405 410' 415 

Ala Ala Leu Ser Gin Ser Gin Arg Pro Arg Pro Thr Lys Lys His Lys 
420 425 , 430 

Gly Gly 
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(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCCAGGCAG TTGCTGCAGC TGCGGAGTAT GCGGGCCTGA AAGTGGCTCG CCGCCAAATG 6 0 

CAAGATGCTG CTGGCCGCCG CCACTTCCAT GCCTCTCAGT GCCCAAGGCC CACGAGTCCT 12 0 

GTGTCCACAG ACAGCAACAT GAGTGCTGTT GTGATCCAGA AAGCCAGACC CGCCAAGAAG 18 0 

CAGAAACACC AGCCAGGACA TCTGCGCAGG GAAGCCTACG CAGATGATCT TCCACCCCCT 24 0 

CCAGTGCCAC CACCTGCTAT AAAATCGCCC ACTGTCCAGT CCAAGGCACA GCTGGAGGTA 3 00 

CGGCCTGTCA TGGTGCCAAA ACTCGCGTCT ATAGAAGCAA GGACAGATAG ATCGTCAGAC 3 60 

AGAAAAGGAG GCAGTTACAA GGGGAGAGAA GCTCTGGATG GAAGACAAGT CACTGACCTG 42 0 

CGAACAAATC CAAGTGACCC CAGA 444 

(2) INFORMATION FOR SEQ ID NO : 12 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ala Gin Ala Val Ala Ala Ala Ala Glu Tyr Ala Gly Leu Lys Val Ala 

1 5 10 15 

Arg Arg Gin Met Gin Asp Ala Ala Gly Arg Arg His Phe His Ala Ser 

20 25 30 

Gin Cys Pro Arg Pro Thr Ser Pro Val Ser Thr Asp Ser Asn Met Ser 

35 40 45 

Ala Val Val lie Gin Lys Ala Arg Pro Ala Lys Lys Gin Lys His Gin 

50 , 55 60 

Pro Gly His Leu Arg Arg Glu Ala Tyr Ala Asp Asp Leu Pro Pro Pro 
65 70 75 80 

Pro Val Pro Pro Pro Ala lie Lys Ser Pro Thr Val Gin Ser Lys Ala 
85 90 95 
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Gin Leu Glu Val Arg Pro Val Met Val Pro Lys Leu Ala Ser lie Glu 

100 105 110 

Ala Arg Thr Asp Arg Ser Ser Asp Arg Lys Gly Gly Ser Tyr Lys Gly 

115 120 125 

Arg Glu Ala Leu Asp Gly Arg Gin Val Thr Asp Leu Arg Thr Asn Pro 

130 135 140 

Ser Asp Pro Arg 
145 
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